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SMAD ASSOCIATING POLYPEPTIDES 

Field of the Invention 

This invention relates to nucleic acids and encoded polypeptides which interact with 
5 Smad proteins. The invention also relates to agents which bind the nucleic acids or 
polypeptides. The invention further relates to methods of using such nucleic acids and 
polypeptides in the treatment and/or diagnosis of disease. 

Background of the Invention 

1 0 Members of the transforming growth factor-P (TGF-p) family are multifunctional 

cytokines with elicit a wide range of cellular effects, including growth inhibition, 
differentiation and apoptosis (Heldin et al., Nature 390:465-471, 1997). The signaling 
induced by TGF-p family members are initiated through a heteromeric transmembrane kinase 
complex that consists of type I and type II receptors. The activated type I receptor induces 

15 the phosphorylation of receptor-activated Smads (R-Smads) which heteromerize with Smad4. 
These complexes translocate from the cytoplasm to the nucleus to direct transcriptional 
regulation of responsive genes (Heldin et al., 1997). 

Recently, Smad6 and Smad7 were isolated, which form a subfamily among the Smads 
and function to inhibit the intracellular signaling by R-Smad/Smad4 complexes. Smad6 and 

20 Smad7 constitutively associate with type I receptor by blocking association and 

phosphorylation of R-Smads (Hayashi et al., Cell 89:1 165-1 173, 1997; Imamura et al., 
Nature 389:622-626, 1997; Nakao et al., Atawre 389:631-635, 1997). Smad6 and Smad7 are 
rapidly induced by members of the TGF-p family (Aftakhte et al., Biochem. Biophys. Res. 
Commun 249:505-51 1, 1998), suggesting that inhibitory Smads may take part in a negative 

25 feedback control mechanism to modulate the signaling induced by members of TGF-P 
family. 

The central role of Smads and TGF-p in cellular processes presents a need for 
additional factors to modulate Smads and TGF-p interactions with signal transduction 
pathways. 



Summary of the Invention 
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Using the yeast two hybrid system, proteins that specifically bind with Smad6 and 
Smad7 have been isolated. The invention provides these isolated Smad associating proteins 
(SAPs) and fragments of those molecules, as well as agents which bind such polypeptides, 
including antibodies. The invention also provides nucleic acid molecules encoding SAPs, 
5 unique fragments of those molecules, expression vectors containing the foregoing, and host 
cells transfected with those molecules. The foregoing can be used in the diagnosis or 
treatment of conditions characterized by the expression of a Smad associating protein, or in 
tbf ttieatment of ^onditiciis charscicH?e4 by the expression of a SAP, or *h th^^re^talent of k" 
concut> characterized by the expression of a Smad nucleic acid or polypeptide, or by the 
10 inadequate or excessive activity of a Smad polypeptide. The invention also provides methods 
for identifying pharmacological agents useful in the diagnosis or treatment of such 
conditions. Here, the identification of several SAPs is presented. The SAPs bind to Smad 
polypeptides including Smad6 and Smad7 and thus are components of TGF-P superfamily 
signaling pathways. 

15 According to one aspect of the invention, isolated nucleic acid molecules are 

provided. The isolated nucleic acid molecules are nucleic acid molecules which hybridize 
under stringent conditions to a molecule consisting of the nucleic acid sequence set forth in 
SEQ ID NO:3 or SEQ ID NO:5 and which code for a polypeptide which binds Smad6, or 
nucleic acid molecules that differ from the foregoing nucleic acid molecules in codon 

20 sequence due to the degeneracy of the genetic code, or complements of the foregoing nucleic 
acid molecules* Preferably the isolated nucleic acid molecule consists of SEQ ID NO:3 or 
SEQIDNO:5. 

According to another aspect of the invention, isolated nucleic acid molecules are 
provided which are unique fragments of nucleotides 1-2399 of SEQ ID NO:3 between 12 and 

25 2398 nucleotides in length or of nucleotides 1-855 of SEQ ID NO:5 between 12 and 854 
nucleotides in length. Also provided are complements of the foregoing unique fragments 
provided that the nucleic acid molecule excludes sequences consisting of GenBank accession 
numbers AF176069, AF293384, AA305358, AI2191 12, N33797 and AB030502. In certain 
embodiments, the isolated nucleic acid molecule consists of at least 22, 25, 30, 40, 50, 75 or 

30 100 contiguous nucleotides* In other embodiments, the isolated nucleic acid molecule 
consists of between 20 and 32 contiguous nucleotides. 
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According to still another aspect of the invention, expression vectors including any of 
the foregoing isolated nucleic acid molecules operably linked to a promoter are provided. 
Also provided are host cells transformed or transfected with the expression vectors, as well as 
transgenic non-human animals including the expression vectors. 
5 According to yet another aspect of the invention, methods for producing a polypeptide 

are provided. The methods include culturing the foregoing host cells under conditions which 
permit the expression of polypeptide. Preferably the methods include isolating the 
polypeptide!:"" ' : f^;r0^-^? ■ -r"\" t . - '• -^.^ 

In another aspect of the invention, isolated polypeptides are provided which are 
10 encoded by the foregoing isolated nucleic acid molecules. Preferred isolated polypeptides 
include molecules comprising the amino acid sequences of SEQ ID NO:4, SEQ ID NO:6, 
fragments or functional variants of SEQ ID NO:4, and a fragments or functional variants of 
SEQIDNO:6. 

According to still another aspect of the invention, isolated polypeptides are provided 

15 which include a fragment or functional variant of SEQ ID NO:2. In certain embodiments the 
fragment of SEQ ID NO:2 consists of amino acids 1-101+234-424, 106-424 or 234-424. 

According to yet another aspect of the invention, an isolated complex of polypeptides 
is provided. The complex includes one of the foregoing polypeptide bound to a polypeptide 
selected from the group consisting of Smad6, Smad7 and fragments thereof. 

20 Also included as an aspect of the invention are isolated polypeptides which bind 

selectively a polypeptide encoded by the foregoing isolated nucleic acid molecules, provided 
that the isolated polypeptide is not a Smad, STAM or cyclin polypeptide. In certain 
embodiments, the isolated polypeptide binds to an epitope defined by a polypeptide 
consisting of the sequence of SEQ ID NOs:2, 4 or 6. In other embodiments, the isolated 

25 polypeptide is an antibody fragment selected from the group consisting of a Fab fragment, a 
F(ab) 2 fragment or a fragment including a CDR3 region selective for a SAP polypeptide. In 
still other embodiments the isolated polypeptide is a monoclonal antibody, a humanized 
antibody or a chimeric antibody. 

According to still another aspect of the invention, methods for modulating TGF-P 

30 superfamily signal transduction in a mammalian cell are provided. The methods include 
contacting the mammalian cell with an amount of an agent which increases the amount of a 
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Smad associating protein selected from the group consisting of SAP1/AMSH (SEQ ID 
NO:2), SAP2 (SEQ ID NO:4), SAP3 (SEQ ID NO:6), Hsp40 homolog (U40992; SEQ ID 
NO:8), Uba80 (X63237; SEQ ID NO:10), Tax-1 binding protein (U33822; SEQ ID NO:12), 
rabaptin-5 (NM_004703; SEQ ID NO:14), and 26S proteinase S5a (U51007; SEQ ID NO:16) 
5 or a fragment thereof in the cell effective to reduce TGF-ji superfamily signal transduction in 
the mammalian cell. In certain embodiments, the agent is a nucleic acid molecule encoding 
one of the foregoing polypeptides. 

According to mother ' 
a mammalian cell are provided. The methods include contacting the mammalian cell with an 

1 0 amount of an agent which increases the amount of S AP2 (SEQ ID NO:4), or a fragment 

thereof, in the cell effective to bind a cyclin and regulate the cell cycle in the mammalian cell. 

In further aspects of the invention, methods for identifying lead compounds for a 
pharmacological agent are provided In certain embodiments, the methods include forming a 
mixture comprising a Smad6 or Smad7 polypeptide, a SAP polypeptide, and a candidate 

15 pharmacological agent, incubating the mixture under conditions which, in the absence of the 
candidate pharmacological agent, permit a first amount of specific binding of the SAP 
polypeptide by the Smad6 or Smad7 polypeptide, and detecting a test amount of the specific 
binding of the SAP polypeptide by the Smad6 or Smad7 polypeptide. A reduction of the test 
amount of specific binding relative to the first amount of specific binding indicates that the 

20 candidate pharmacological agent is a lead compound for a pharmacological agent which 
disrupts the Smad6-SAP or Smad7-SAP binding, and an increase of the test amount of 
specific binding relative to the first amount of specific binding indicates that the candidate 
pharmacological agent is a lead compound for a pharmacological agent which enhances the 
Smad6-SAP or Smad7-SAP binding. Preferably the SAP polypeptide is selected from the 

25 group consisting of S AP1/AMSH, SAP2, SAP3 and fragments thereof 

In other embodiments, the methods include forming a mixture comprising an ALK 
kinase, a Smad polypeptide, a SAP polypeptide, and a candidate pharmacological agent, 
incubating the mixture under conditions which, in the absence of the candidate 
pharmacological agent, permit a first amount of specific binding of the SAP polypeptide by 

30 the Smad polypeptide, and detecting a test amount of the specific binding of the SAP 
polypeptide by the Smad polypeptide. A reduction of the test amount of specific binding 
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relative to the first amount of specific binding indicates that the candidate pharmacological 
agent is a lead compound for a pharmacological agent which disrupts the Smad-SAP binding, 
and an increase of the test amount of specific binding relative to the first amount of specific 
binding indicates that the candidate pharmacological agent is a lead compound for a 
5 pharmacological agent which enhances the Smad-SAP binding. In preferred embodiments, 
the SAP polypeptide is selected from the group consisting of SAP1/AMSH, SAP2, SAP3 and 
fragments thereof, the Smad polypeptide is selected from the group consisting of Smad2, 
S.srodS, Smscl4, S^ad6, Stnad? and fragment? thereof, arid the ALK kinase 'is ' sLL-c-wrd i<rm. 
the group consisting of ALK5, constitutively activated ALK5, ALK6, constitutively activated 
10 ALK6 and fragments thereof having kinase activity. 

The use of the foregoing compositions in the preparation of a medicament is also 
contemplated. 

These and other aspects of the invention will be described in further detail in 
connection with the detailed description of the invention. 

15 

Brief Description of the Figures 

Fig. 1 depicts a schematic structures of Smad6S and Smad7 mutants used in the yeast 
two hybrid assay. The human short form of Smad6 and mouse Smad7 were inserted into 
pEG202. 

20 Fig. 2 shows the evaluation of Smad6-associating proteins using yeast two hybrid 

assay. Smad2, Smad4, Smad6S and Smad7 were used as baits to examine interaction with 
Smad6-associating proteins in yeast. 

Fig. 3 depicts a map of isolated AMSH (SAP1) clones. 
Fig. 4 shows a map of isolated SAP2 clones. 
25 Fig. 5 shows a schematic illustration of SAP 1/AMSH mutants. NLS, putative nuclear 

localization signal; PI and P2, proposed SH3 binding regions; JSH, JAB1 subdomain 
homologous regions. 

Brief Description of the Sequences 
30 SEQ ID NO: 1 is the nucleotide sequence of human S AP1/AMSH. 

SEQ ID NO:2 is the amino acid sequence of human SAP1/AMSH. 



WO 01/21794 PCT/US00/25790 

-6- 

SEQ ID NO:3 is the nucleotide sequence of human SAP2. 
SEQ ID NO:4 is the amino acid sequence of human SAP2. 
SEQ ID NO:5 is the nucleotide sequence of human SAP3. 
SEQ ID NO:6 is the amino acid sequence of human SAP3. 
5 SEQ ID NO:7 is the nucleotide sequence of the Hsp40 homolog having GenBank 

accession number U40992. 

SEQ ID NO:8 is the amino acid sequence of the Hsp40 homolog having GenBank 
ir,<t ^ accession number U4099? r v 7 ^ ^ ' 

SEQ ID NO:9 is the nucleotide sequence of Uba80, having GenBank accession 
10 number X63237. 

SEQ ID NO: 10 is the amino acid sequence of Uba80, having GenBank accession 
number X63237. 

SEQ ID NO:l 1 is the nucleotide sequence of Tax- 1 binding protein, having GenBank 
accession number U33822 
15 SEQ ID NO: 12 is the amino acid sequence of Tax- 1 binding protein, having GenBank 

accession number U33822. 

SEQ ID NO: 13 is the nucleotide sequence of rabaptin-5, having GenBank accession 
number NMJ)04703. 

SEQ ID NO: 14 is the amino acid sequence of rabaptin-5, having GenBank accession 
20 number NM 004703. 

SEQ ID NO: 15 is the nucleotide sequence of the 26S proteinase S5a, having GenBank 
accession number U51007. 

SEQ ID NO: 16 is the amino acid sequence of the 26S proteinase S5a, having 
GenBank accession number U51007. 

25 

Detailed Description of the Invention 

The present invention in one aspect involves the cloning of cDNAs encoding several 
Smad associating proteins (SAPs). The sequence of the human nucleic acids for SAP1 , SAP2 
and SAP3 are presented as SEQ ID NOs:l, 3 and 5, respectively, and the predicted amino 
30 acid sequences of the protein products are presented as SEQ ID NOs:2, 4 and 6. Analysis of 
the sequences by comparison to nucleic acid and protein databases determined that SAP1 
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corresponds to the human AMSH gene (GenBank accession numbers NM006463, U73522) 
and that SAP2 is related to aXenopus gene, XDRP1 (Funakoshi et al., EMBOJ. 18:5009- 
5018, 1 999). To the extent that the SAP polypeptides identified herein are similar to 
previously identified sequences, it is entirely unexpected that the polypeptides are binding 
5 partners for Smad proteins. 

The invention thus involves in one aspect SAP polypeptides, genes encoding those 
polypeptides, functional modifications and variants of the foregoing, useful fragments of the 
fcregoirrP as^i^l^i:h9iapeuHcs relating ifoeft-. The expression of -thr.se- gees * 
TGF-p super&mily signal transduction by binding to Smad polypeptides including tmad6 

10 and Smad7. The TGF-P superfamily members are well known to those of ordinary skill in 
the art and include TGF-ps, activins, bone morphogenetic proteins (BMPs), Vgl, Mullerian 
inhibitory substance (MIS) and growth/differentiation factors (GDFs). 

Homologs and alleles of the Smad associating protein-encoding nucleic acids of the 
invention can be identified by conventional techniques. Thus, an aspect of the invention is 

15 those nucleic acid sequences which code for SAP polypeptides and which hybridize to a 
nucleic acid molecule consisting of the coding region of SEQ ID NO: 1, SEQ ID NO:3 or 
SEQ ID NO:5, under stringent conditions. The term "stringent conditions" as used herein 
refers to parameters with which the art is familiar. Nucleic acid hybridization parameters 
may be found in references which compile such methods, e.g. Molecular Cloning: A 

20 Laboratory Manual, J. Sambrook, et al., eds., Second Edition, Cold Spring Harbor 

Laboratory Press, Cold Spring Harbor, New York, 1989, or Current Protocols in Molecular 
Biology, F.M. Ausubel, et al., eds., John Wiley & Sons, Inc., New York. More specifically, 
stringent conditions, as used herein, refers, for example, to hybridization at 65°C in 
hybridization buffer (3.5 x SSC, 0.02% Ficoll, 0.02% polyvinyl pyrrolidone, 0,02% Bovine 

25 Serum Albumin, 2.5mM NaH 2 P0 4 (pH7), 0.5% SDS, 2mM EDTA). SSC is 0. 1 5M sodium 
chloride/0. 15M sodium citrate, pH7; SDS is sodium dodecyl sulphate; and EDTA is 
ethylenediaminetetracetic acid. After hybridization, the membrane upon which the DNA is 
transferred is washed at 2 x SSC at room temperature and then at 0.1 - 0.5 X SSC/0.1 x SDS 
at temperatures up to 68°C. 

30 There are other conditions, reagents, and so forth which can be used, which result in a 

similar degree of stringency. The skilled artisan will be familiar with such conditions, and 
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thus they are not given here. It will be understood, however, that the skilled artisan will be 
able to manipulate the conditions in a manner to permit the clear identification of homologs 
and alleles of SAP nucleic acids of the invention. The skilled artisan also is familiar with the 
methodology for screening cells and libraries for expression of such molecules which then are 
routinely isolated, followed by isolation of the pertinent nucleic acid molecule and 
sequencing. 

In general homologs and alleles typically will share at least 75% nucleotide identity 
and/or at least 90% oinfco soididtentity tb SEQ ID NO&l; 3 or 5 and SEQ IB NO?:2; 4 <*r 5, 
respectively, in some instances will share at least 90% nucleotide identity and/or at least 95% 
amino acid identity and in still other instances will share at least 95% nucleotide identity 
and/or at least 99% amino acid identity. The homology can be calculated using various, 
publicly available software tools developed by NCBI (Bethesda, Maryland) that can be 
obtained through the Internet (ftp:/ncbi.nlm.nih.gov/pub/). Exemplary tools include the 
BLAST system available at http://www.ncbi.nlm.nih.gov, preferably using default settings. 
Pairwise and ClustalW alignments (BLOSUM30 matrix setting) as well as Kyle-Doolittle 
hydropathic analysis can be obtained using the Mac Vector sequence analysis software 
(Oxford Molecular Group). Watson-Crick complements of the foregoing nucleic acids also 
are embraced by the invention. 

In screening for nucleic acids encoding Smad associating proteins with sequence 
homology to the SAP nucleic acids described herein, a Southern blot may be performed using 
the foregoing conditions, together with a radioactive probe. After washing the membrane to 
which the DNA is finally transferred, the membrane can be placed against X-ray film to 
detect the radioactive signal. 

As used herein with respect to nucleic acids, the term "isolated" means: (i) amplified 
in vitro by, for example, polymerase chain reaction (PCR); (ii) recombinantly produced by 
cloning; (iii) purified, as by cleavage and gel separation; or (iv) synthesized by, for example, 
chemical synthesis. An isolated nucleic acid is one which is readily manipulable by 
recombinant DNA techniques well known in the art. Thus, a nucleotide sequence contained 
in a vector in which 5' and 3' restriction sites are known or for which polymerase chain 
reaction (PCR) primer sequences have been disclosed is considered isolated but a nucleic acid 
sequence existing in its native state in its natural host is not. An isolated nucleic acid may be 
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substantially purified, but need not be. For example, a nucleic acid that is isolated within a 
cloning or expression vector is not pure in that it may comprise only a tiny percentage of the 
material in the cell in which it resides. Such a nucleic acid is isolated, however, as the term i 
used herein because it is readily manipulable by standard techniques known to those of 
ordinary skill in the art. An isolated nucleic acid as used herein is not a naturally occurring 
chromosome. 

As used herein with respect to polypeptides, "isolated" means separated from its 

environment and preseniinsuffic^^ 
Isolated, when referring to a protein or polypeptide, means, for example: (i) selectively 
produced by expression cloning or (ii) purified as by chromatography or electrophoresis. 
Isolated proteins or polypeptides may be, but need not be, substantially pure. The term 
"substantially pure" means that the proteins or polypeptides are essentially free of other 
substances with which they may be found in nature or in vivo systems to an extent practical 
and appropriate for their intended use. Substantially pure polypeptides may be produced by 
techniques well known in the art. Because an isolated protein may be admixed with a 
pharmaceutically acceptable carrier in a pharmaceutical preparation, the protein may 
comprise only a small percentage by weight of the preparation. The protein is nonetheless 
isolated in that it has been separated from the substances with which it may be associated in 
living systems, i.e. isolated from other proteins. 

The invention also includes degenerate nucleic acids which include alternative codons 
to those present in the native materials. For example, serine residues are encoded by the 
codons TCA, AGT, TCC, TCG, TCT and AGC. Each of the six codons is equivalent for the 
purposes of encoding a serine residue. Thus, it will be apparent to one of ordinary skill in the 
art that any of the serine-encoding nucleotide triplets may be employed to direct the protein 
synthesis apparatus, in vitro or in vzvo, to incorporate a serine residue into an elongating 
Smad7 polypeptide. Similarly, nucleotide sequence triplets which encode other amino acid 
residues include, but are not limited to: CCA, CCC, CCG and CCT (proline codons); CGA, 
CGC, CGG, CGT, AGA and AGG (arginine codons); ACA, ACC, ACG and ACT (threonine 
codons); AAC and AAT (asparagine codons); and ATA, ATC and ATT (isoleucine codons). 
Other amino acid residues may be encoded similarly by multiple nucleotide sequences. Thus, 
the invention embraces degenerate nucleic acids that differ from the biologically isolated 
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nucleic acids in codon sequence due to the degeneracy of the genetic code. 

The invention also provides modified nucleic acid molecules which include additions, 
substitutions and deletions of one or more nucleotides. In preferred embodiments, these 
modified nucleic acid molecules and/or the polypeptides they encode retain at least one 
activity or function of the unmodified nucleic acid molecule and/or the polypeptides, such as 
Smad binding, antigenicity, enzymatic activity, receptor binding, etc. In certain 
embodiments, the modified nucleic acid molecules encode modified polypeptides, preferably 
polypeptides having e^^ervative amino ^cid svfelit^o^s as are desciibed elsewhere herd-: 
The modified nucleic acid molecules are structurally relatea to the unmodified nucleic acid 
molecules and in preferred embodiments are sufficiently structurally related to the 
unmodified nucleic acid molecules so that the modified and unmodified nucleic acid 
molecules hybridize under stringent conditions known to one of skill in the art. 

For example, modified nucleic acid molecules which encode polypeptides having 
single amino acid changes can be prepared. Each of these nucleic acid molecules can have 
one, two or three nucleotide substitutions exclusive of nucleotide changes corresponding to 
the degeneracy of the genetic code as described herein. Likewise, modified nucleic acid 
molecules which encode polypeptides having two amino acid changes can be prepared which 
have, e.g., 2-6 nucleotide changes. Numerous modified nucleic acid molecules like these will 
be readily envisioned by one of skill in the art, including for example, substitutions of 
nucleotides in codons encoding amino acids 2 and 3, 2 and 4, 2 and 5, 2 and 6, and so on. In 
the foregoing example, each combination of two amino acids is included in the set of 
modified nucleic acid molecules, as well as all nucleotide substitutions which code for the 
amino acid substitutions. Additional nucleic acid molecules that encode polypeptides having 
additional substitutions (i.e., 3 or more), additions or deletions (e.g., by introduction of a stop 
codon or a splice site(s)) also can be prepared and are embraced by the invention as readily 
envisioned by one of ordinary skill in the art. Any of the foregoing nucleic acids or 
polypeptides can be tested by routine experimentation for retention of structural relation or 
activity to the nucleic acids and/or polypeptides disclosed herein. 

The invention also provides isolated unique fragments of SEQ ID NOs:l, 3 or 5 or 
complements of SEQ ID NOs: 1 , 3 or 5 of sufficient length to represent a sequence unique 
within the human genome, and identifying a nucleic acid encoding a Smad associating 
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polypeptide. A unique fragment is one that is a 'signature' for the larger nucleic acid. It, for 
example, is long enough to assure that its precise sequence is not found in molecules outside 
of the SAP nucleic acids defined above. A unique fragment includes a sequence of 
contiguous nucleotides which is not identical to any sequence selected from the sequence 
5 group consisting of (1) sequences having the GenBank accession numbers AF1 76069, 
AF293384, AA305358, AI2191 12, N33797, AB030502 and other sequences publicly 
available as of the filing date of this application, (2) complements of (1), and (3) fragments of 
(l)^and <2): Thus a unique fragment excludes, by definition, sequent W^^g ^kly o£ 
EST and/or gene sequences such as those described by GenBank accession numbers 

10 AF176069,AF293384,AA305358,AI219112,N33797andAB030502. 

A fragment which is completely composed of the sequence described in the foregoing 
GenBank deposits is one which does not include any of the nucleotides unique to the 
sequences of the invention. Thus, a unique fragment must contain a nucleotide sequence 
other than the exact sequence of those in GenBank or fragments thereof. The difference may 

15 be an addition, deletion or substitution with respect to the GenBank sequence or it may be a 
sequence wholly separate from the GenBank sequence. 

Unique fragments can be used as probes in Southern blot assays to identify such 
nucleic acids, or can be used in amplification assays such as those employing PCR. As 
known to those skilled in the art, large probes such as 200 250, 300 or more nucleotides are 

20 preferred for certain uses such as Southern blots, while smaller fragments will be preferred 
for uses such as PCR. Unique fragments also can be used to produce fusion proteins for 
generating antibodies or determining binding of the polypeptide fragments, as demonstrated 
in the Examples, or for generating immunoassay components. Likewise, unique fragments 
can be employed to produce nonfused fragments of the SAP polypeptides such as the N- 

25 terminal and C-terminal fragments disclosed herein, useful, for example, in the preparation of 
antibodies, in immunoassays, and as a competitive binding partner of the S APs and/or other 
polypeptides which bind to Smad 6 or Smad7 polypeptides, for example, in therapeutic 
applications. Unique fragments further can be used as antisense molecules to inhibit the 
expression of SAP nucleic acids and polypeptides, particularly for therapeutic purposes as 

30 described in greater detail below. 

As will be recognized by those skilled in the art, the size of the unique fragment will 
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depend upon its conservancy in the genetic code. Thus, some regions of SEQ ID NOs:l, 3 
and/or SEQ ID NO:5 and its complement will require longer segments to be unique while 
others will require only short segments, typically between 12 and 32 nucleotides (e.g. 12, 13, 
14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31 and 32 bases long). This 
5 disclosure intends to embrace each and every fragment of each sequence, beginning at the 
first nucleotide, the second nucleotide and so on, up to 8 nucleotides short of the end, and 
ending anywhere from nucleotide number 8, 9, 10 and so on for each sequence, rp to the very 
lasr < ?ucbet : (pw^tdM the sequence is uniqtk^.^scribed above V Msay sc^-: ii§%PS^0 * 
ID NO:3 or SEQ ID NO:5, or complements thereof, that are 25 or more nucleotides in length 

10 will be unique. Those skilled in the art are well versed in methods for selecting such 
sequences, typically on the basis of the ability of the unique fragment to selectively 
distinguish the sequence of interest from non-SAP nucleic acids. A comparison of the 
sequence of the fragment to those on known data bases typically is all that is necessary, 
although in vitro confirmatory hybridization and sequencing analysis may be performed. 

15 A unique fragment can be a functional fragment. A functional fragment of a nucleic 

acid molecule of the invention is a fragment which retains some functional property of the 
larger nucleic acid molecule, such as coding for a functional polypeptide, binding to proteins 
(e.g., Smads), regulating transcription of operably linked nucleic acids, and the like. One of 
ordinary skill in the art can readily determine using the assays described herein and those well 

20 known in the art to determine whether a fragment is a functional fragment of a nucleic acid 
molecule using no more than routine experimentation. 

As mentioned above, the invention embraces antisense oligonucleotides that 
selectively bind to a nucleic acid molecule encoding a SAP polypeptide, to modulate TGF-p, 
activin and/or BMP signaling by reducing the amount of SAPs. This is desirable in virtually 

25 any medical condition wherein a reduction of SAP binding to Smad proteins is desirable, e.g., 
to modulate Smad activity such as in TGF-p signaling. 

As used herein, the term "antisense oligonucleotide" or "antisense" describes an 
oligonucleotide that is an oligoribonucleotide, oligodeoxyribonucleotide, modified 
oligoribonucleotide, or modified oligodeoxyribonucleotide which hybridizes under 

30 physiological conditions to DNA comprising a particular gene or to an mRNA transcript of 
that gene and, thereby, inhibits the transcription of that gene and/or the translation of that 
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mRNA. The antisense molecules are designed so as to interfere with transcription or 
translation of a target gene upon hybridization with the target gene or transcript. Those 
skilled in the art will recognize that the exact length of the antisense oligonucleotide and its 
degree of complementarity with its target will depend upon the specific target selected, 
5 including the sequence of the target and the particular bases which comprise that sequence. It 
is preferred that the antisense oligonucleotide be constructed and arranged so as to bind 
selectively with the target under physiological conditions, i.e., to hybridize substantially more 

; to the target seqi;en* e ^ 
conditions. Based upon SEQ ID NOs:l, 3 or 5, or upon allelic or homologous genomic 

1 0 and/or cDNA sequences, or upon the nucleotide sequences of other Smad associating 

polypeptides disclosed herein, one of skill in the art can easily choose and synthesize any of a 
number of appropriate antisense molecules for use in accordance with the present invention. 
For example, a "gene walk" comprising a series of oligonucleotides of 15-30 nucleotides 
spanning the length of a SAP nucleic acid can be prepared, followed by testing for inhibition 

15 of SAP expression. Optionally, gaps of 5-1 0 nucleotides can be left between the 
oligonucleotides to reduce the number of oligonucleotides synthesized and tested. 

In order to be sufficiently selective and potent for inhibition, such antisense 
oligonucleotides should comprise at least 10 and, more preferably, at least 15 consecutive 
bases which are complementary to the target, although in certain cases modified 

20 oligonucleotides as short as 7 bases in length have been used successfully as antisense 

oligonucleotides (Wagner et al., Nature Biotechnol 14:840-844, 1996). Most preferably, the 
antisense oligonucleotides comprise a complementary sequence of 20-30 bases. Although 
oligonucleotides may be chosen which are antisense to any region of the gene or mKNA 
transcripts, in preferred embodiments the antisense oligonucleotides correspond to N-terminal 

25 or 5' upstream sites such as translation initiation, transcription initiation or promoter sites. In 
addition, 3 '-untranslated regions may be targeted. Targeting to mRNA splicing sites has also 
been used in the art but may be less preferred if alternative mRNA splicing occurs. In 
addition, the antisense is targeted, preferably, to sites in which mRNA secondary structure is 
not expected (see, e.g., Sainio et al., Cell MoL Neurobiol 14(5):43 9-457, 1994) and at which 

30 proteins are not expected to bind. Finally, although SEQ ID Nos: 1 , 3 or 5 disclose cDNA 
sequences, one of ordinary skill in the art may easily derive the genomic DNA corresponding 
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to the cDNA of SEQ ID Nos: 1 , 3 or 5. Thus, the present invention also provides for antisense 
oligonucleotides which are complementary to the genomic DNA corresponding to SEQ ID 
Nos: 1, 3 or 5. Similarly, antisense to allelic or homologous cDNAs and genomic DNAs are 
enabled without undue experimentation. 
5 In one set of embodiments, the antisense oligonucleotides of the invention may be 

composed of "natural" deoxyribonucleotides, ribonucleotides, or any combination thereof. 
That is, the 5' end of one native nucleotide and the 3* end of another native nucleotide may be 
t^v^w^Iy These 
oligonucleotides may be prepared by art recognised methods which may be carried out 
10 manually or by an automated synthesizer. They also may be produced recombinantly by 
vectors. 

In preferred embodiments, however, the antisense oligonucleotides of the invention 
also may include "modified" oligonucleotides. That is, the oligonucleotides may be modified 
in a number of ways which do not prevent them from hybridizing to their target but which 

15 enhance their stability or targeting or which otherwise enhance their therapeutic effectiveness. 

The term "modified oligonucleotide" as used herein describes an oligonucleotide in 
which (1) at least two of its nucleotides are covalently linked via a synthetic internucleoside 
linkage (i.e., a linkage other than a phosphodiester linkage between the 5* end of one 
nucleotide and the 3 1 end of another nucleotide) and/or (2) a chemical group not normally 

20 associated with nucleic acids has been covalently attached to the oligonucleotide. Preferred 
synthetic internucleoside linkages are phosphorothioates, alkylphosphonates, 
phosphorodithioates, phosphate esters, alkylphosphonothioates, phosphoramidates, 
carbamates, carbonates, phosphate triesters, acetamidates, carboxymethyl esters and peptides. 
The term "modified oligonucleotide" also encompasses oligonucleotides with a 

25 covalently modified base and/or sugar. For example, modified oligonucleotides include 
oligonucleotides having backbone sugars which are covalently attached to low molecular 
weight organic groups other than a hydroxyl group at the 3* position and other than a 
phosphate group at the 5' position. Thus modified oligonucleotides may include a 2'-0- 
alkylated ribose group. In addition, modified oligonucleotides may include sugars such as 

30 arabinose instead of ribose. The present invention, thus, contemplates pharmaceutical 
preparations containing modified antisense molecules that are complementary to and 
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hybridizable with, under physiological conditions, nucleic acids encoding SAP polypeptides, 
together with pharmaceutically acceptable carriers. 

Antisense oligonucleotides may be administered as part of a pharmaceutical 
composition. Such a pharmaceutical composition may include the antisense oligonucleotides 
5 in combination with any standard physiologically and/or pharmaceutically acceptable carriers 
which are known in the art. The compositions should be sterile and contain a therapeutically 
effective amount of the antisense oligonucleotides in a unit of weight or volume suitable for 
, a Lmnistration to a patient The term '^armac-eid^y. acceptable" me&>*s & ftou--toxic 

material that does not interfere with the effectiveness of the biological activity of the active 

10 ingredients. The term "physiologically acceptable" refers to a non-toxic material that is 
compatible with a biological system such as a cell, cell culture, tissue, or organism. The 
characteristics of the carrier will depend on the route of administration. Physiologically and 
pharmaceutically acceptable carriers include diluents, fillers, salts, buffers, stabilizers, 
solubilizers, and other materials which are well known in the art. 

15 As used herein, a "vector" may be any of a number of nucleic acids into which a 

desired sequence may be inserted by restriction and ligation for transport between different 
genetic environments or for expression in a host cell. Vectors are typically composed of 
DNA although RNA vectors are also available. Vectors include, but are not limited to, 
plasmids, phagemids and virus genomes. A cloning vector is one which is able to replicate in 

20 a host cell, and which is further characterized by one or more endonuclease restriction sites at 
which the vector may be cut in a determinable fashion and into which a desired DNA 
sequence may be ligated such that the new recombinant vector retains its ability to replicate 
in the host cell. In the case of plasmids, replication of the desired sequence may occur many 
times as the plasmid increases in copy number within the host bacterium or just a single time 

25 per host before the host reproduces by mitosis. In the case of phage, replication may occur 
actively during a lytic phase or passively during a lysogenic phase. An expression vector is 
one into which a desired DNA sequence may be inserted by restriction and ligation such that 
it is operably joined to regulatory sequences and may be expressed as an RNA transcript. 
Vectors may further contain one or more marker sequences suitable for use in the 

30 identification of cells which have or have not been transformed or transfected with the vector. 
Markers include, for example, genes encoding proteins which increase or decrease either 
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resistance or sensitivity to antibiotics or other compounds, genes which encode enzymes 
whose activities are detectable by standard assays known in the art (e.g., p-galactosidase, 
luciferase or alkaline phosphatase), and genes which visibly affect the phenotype of 
transformed or transfected cells, hosts, colonies or plaques (e.g., green fluorescent protein). 
5 Preferred vectors are those capable of autonomous replication and expression of the structural 
gene products present in the DNA segments to which they are operably joined. 

As used herein, a coding sequence and regulatory sequences are said to be "operably" 

■ . ■ j®hjed*when they &rs covalently \u vfccd in such a way as to p.5&ee ilr e^r^tfifer 
transcription of the coding sequence under the influence or control of the legulatory 

1 0 sequences. If it is desired that the coding sequences be translated into a functional protein, 
two DNA sequences are said to be operably joined if induction of a promoter in the 5* 
regulatory sequences results in the transcription of the coding sequence and if the nature of 
the linkage between the two DNA sequences does not (1) result in the introduction of a 
frame-shift mutation, (2) interfere with the ability of the promoter region to direct the 

15 transcription of the coding sequences, or (3) interfere with the ability of the corresponding 
RNA transcript to be translated into a protein. Thus, a promoter region would be operably 
joined to a coding sequence if the promoter region were capable of effecting transcription of 
that DNA sequence such that the resulting transcript might be translated into the desired 
protein or polypeptide. 

20 The precise nature of the regulatory sequences needed for gene expression may vary 

between species or cell types, but shall in general include, as necessary, 5' non-transcribed 
and 5* non-translated sequences involved with the initiation of transcription and translation 
respectively, such as a TATA box, capping sequence, CAAT sequence, and the like. 
Especially, such 5' non-transcribed regulatory sequences will include a promoter region 

25 which includes a promoter sequence for transcriptional control of the operably joined gene. 
Regulatory sequences may also include enhancer sequences or upstream activator sequences 
as desired. The vectors of the invention may optionally include 5 1 leader or signal sequences. 
The choice and design of an appropriate vector is within the ability and discretion of one of 
ordinary skill in the art. 

30 Expression vectors containing all the necessary elements for expression are 

commercially available and known to those skilled in the art. See, e.g., Sambrook et al., 
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Molecular Cloning; A Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory 
Press, 1989. Cells are genetically engineered by the introduction into the cells of 
heterologous DNA (RNA) encoding a SAP polypeptide or fragment or variant thereof. That 
heterologous DNA (RNA) is placed under operable control of transcriptional elements to 
5 permit the expression of the heterologous DNA in the host cell. 

Preferred systems for mRNA expression in mammalian cells are those such as 
pRc/CMV (available from Invitrogen, Carlsbad, CA) that contain a selectable marker such as 
a gcnr. fe- ^ 0Rfers^©418 resistance (which facilitates the selection of steb'y tr^ y ctiid%^ tj 
lines) and the human cytomegalovirus (CMV) enhancer-promoter sequences. Addition^ ?v, 

10 suitable for expression in primate or canine cell lines is the pCEP4 vector (Invitrogen), which 
contains an Epstein Barr virus (EBV) origin of replication, facilitating the maintenance of 
plasmid as a multicopy extrachromosomal element. Another expression vector is the pEF- 
BOS plasmid containing the promoter of polypeptide Elongation Factor la, which stimulates 
efficiently transcription in vitro. The plasmid is described by Mishizuma and Nagata (Nuc. 

15 Acids Res. 1 8:5322, 1990), and its use in transfection experiments is disclosed by, for 

example, Demoulin (Mol Cell Biol 16:4710-4716, 1996). Still another preferred expression 
vector is an adenovirus, described by Stratford-Perricaudet, which is defective for El and E3 
proteins (J. Clin Invest. 90:626-630, 1992). The use of the adenovirus as an Adeno.Pl A 
recombinant is disclosed by Warnier et al., in intradermal injection in mice for immunization 

20 against PI A {Int. J. Cancer, 67:303-310, 1996). 

The invention also embraces so-called expression kits, which allow the artisan to 
prepare a desired expression vector or vectors. Such expression kits include at least separate 
portions of each of the previously discussed coding sequences. Other components may be 
added, as desired, as long as the previously mentioned sequences, which are required, are 

25 included. 

The invention also permits the construction of SAP gene "knock-outs" in cells and in 
animals, providing materials for studying certain aspects of TGF-p, activin and/or BMP 
signal transduction. 

The invention also provides isolated polypeptides, which include the polypeptides of 
30 SEQ ID NOs:2, 4 and 6 and unique fragments of SEQ ID NOs:2, 4 and 6 including fragments 
shown in Fig. 5 (amino acids 1-226/232-424, 1-194/234-424, 1-233, 1-322/370-424, 1- 
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1 1 1/128-424, 1-1017234-424, 106-424 and 234-424 of SEQ ID NO:2). Such polypeptides are 
useful, for example, alone or as fusion proteins to test Smad binding, to test phosphorylation, 
to generate antibodies, and as a components of an immunoassay. 

A unique fragment of a SAP polypeptide, in general, has the features and 
5 characteristics of unique fragments as discussed above in connection with nucleic acids. As 
will be recognized by those skilled in the art, the size of the unique fragment will depend 
upon factors such as \yhether the fragment constitutes a portion of a conserved protein 
. .■- domain. Thus, some re^on^|SEQ^D.NOs^,,.4 and/cr € *viU r^ire longer ^mc;.^ fc- be " ■mm^^^ ' 
unique while others will require only short segments, typically between 5 and 12 amino acids 

10 (e.g. 5, 6, 7, 8, 9, 1 0, 1 1 and 12 amino acids long). Virtually any segment of SEQ ID NOs:4 
and 6 that is 10 or more amino acids in length will be unique. 

Unique fragments of a polypeptide preferably are those fragments which retain a 
distinct functional capability of the polypeptide. Functional capabilities which can be 
retained in a unique fragment of a polypeptide include binding of Smad6 and/or Smad7, 

15 interaction with antibodies, interaction with other polypeptides (such as TpR-I) or fragments 
thereof, selective binding of nucleic acids or proteins, and enzymatic activity. For example, 
as exemplified herein, N-terminal and C-terminal SAP1/AMSH fragments such as those 
depicted in Fig. 5 can be used as a fiinctional equivalent of full length SAP1/AMSH in the 
methods of the invention, including e.g., binding of Smads for modulation of TGF-P signal 

20 transduction. Other SAP polypeptide fragments, e.g., other N-terminal or C-terminal 
fragments, can be selected according to their functional properties. For example, one of 
ordinary skill in the art can prepare SAP fragments recombinantly and test those fragments 
according to the methods exemplified below, such as binding to a Smad polypeptide. Those 
skilled in the art also are well versed in methods for selecting unique amino acid sequences, 

25 typically on the basis of the ability of the unique fragment to selectively distinguish the 
sequence of interest from non-family members. A comparison of the sequence of the 
fragment to those on known data bases typically is all that is necessary. 

The invention embraces variants of the SAP polypeptides described above. As used 
herein, a "variant" of a SAP polypeptide is a polypeptide which contains one or more 

30 modifications to the primary amino acid sequence of a SAP polypeptide. Modifications 
which create a SAP variant can be made to a SAP polypeptide 1) to reduce or eliminate an 
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activity of a SAP polypeptide, such as binding to a Smad polypeptide; 2) to enhance a 
property of a SAP polypeptide, such as protein stability in an expression system or the 
stability of protein-protein binding; or 3) to provide a novel activity or property to a SAP 
polypeptide, such as addition of an antigenic epitope or addition of a detectable moiety. 
5 Modifications to a SAP polypeptide are typically made to the nucleic acid which encodes the 
SAP polypeptide, and can include deletions, point mutations, truncations, amino acid 
substitutions and additions of amino acids or non-amino acid moieties. Alternatively, 

linker molecule, addition of a detectable moiety, such as biotin, addition of a fatty acid, and 
10 the like. Modifications also embrace fusion proteins comprising all or part of the SAP amino 
acid sequence. 

In general, variants include SAP polypeptides which are modified specifically to alter 
a feature of the polypeptide unrelated to its physiological activity. For example, cysteine 
residues can be substituted or deleted to prevent unwanted disulfide linkages. Similarly, 

1 5 certain amino acids can be changed to enhance expression of a SAP polypeptide by 
eliminating proteolysis by proteases in an expression system (e.g., dibasic amino acid 
residues in yeast expression systems in which KEX2 protease activity is present). 

Mutations of a nucleic acid which encode a Smad7 polypeptide preferably preserve 
the amino acid reading frame of the coding sequence, and preferably do not create regions in 

20 the nucleic acid which are likely to hybridize to form secondary structures, such a hairpins or 
loops, which can be deleterious to expression of the variant polypeptide. 

Mutations can be made by selecting an amino acid substitution, or by random 
mutagenesis of a selected site in a nucleic acid which encodes the polypeptide. Variant 
polypeptides are then expressed and tested for one or more activities to determine which 

25 mutation provides a variant polypeptide with the desired properties. Further mutations can be 
made to variants (or to non-variant SAP polypeptides) which are silent as to the amino acid 
sequence of the polypeptide, but which provide preferred codons for translation in a particular 
host. The preferred codons for translation of a nucleic acid in, e.g., E. coli, are well known to 
those of ordinary skill in the art. Still other mutations can be made to the noncoding 

30 sequences of a SAP gene or cDNA clone to enhance expression of the polypeptide. The 
activity of variants of SAP polypeptides can be tested by cloning the gene encoding the 
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variant SAP polypeptide into a bacterial or mammalian expression vector, introducing the 
vector into an appropriate host cell, expressing the variant SAP polypeptide, and testing for a 
functional capability of the SAP polypeptides as disclosed herein. For example, the variant 
Smad? polypeptide can be tested for Smad binding as disclosed in the Examples. Preparation 
5 of other variant polypeptides may favor testing of other activities, as will be known to one of 
ordinary skill in the art. 

The skilled artisan will also realize that conservative amino acid substitutions may be 
■^wm^e^ y ' - : mMt ?n SAP poll peptides to provide functionally ^quivalent^^ 

polypeptides, i.e, the variants retain the functional capabilities of the SAP polypeptides. As 

10 used herein, a "conservative amino acid substitution" refers to an amino acid substitution 
which does not alter the relative charge or size characteristics of the protein in which the 
amino acid substitution is made. Variants can be prepared according to methods for altering 
polypeptide sequence known to one of ordinary skill in the art such as are found in references 
which compile such methods, e.g. Molecular Cloning: A Laboratory Manual, J. Sambrook, et 

15 al., eds., Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York, 1989, or Current Protocols in Molecular Biology, F.M. Ausubel, et aL, eds., John 
Wiley & Sons, Inc., New York. Exemplary functionally equivalent variants of the SAP 
polypeptides include conservative amino acid substitutions of SEQ ID NOs:2, 4 or 6. 
Conservative substitutions of amino acids include substitutions made amongst amino acids 

20 within the following groups: (a) M, I, L, V; (b) F, Y, W; (c) K, R, H; (d) A, G; (e) S, T; (f) Q, 
N; and(g)E, D. 

Conservative amino-acid substitutions in the amino acid sequence of SAP 
polypeptides to produce functionally equivalent variants of SAP polypeptides typically are 
made by alteration of a nucleic acid encoding a SAP polypeptide (SEQ ID NOs:l, 3 and 5). 

25 Such substitutions can be made by a variety of methods known to one of ordinary skill in the 
art. For example, amino acid substitutions may be made by PCR-directed mutation, site- 
directed mutagenesis according to the method of Kunkel (Kunkel, Proc. Nat. Acad Set. 
U.S. A. 82: 488-492, 1985), or by chemical synthesis of a gene encoding a SAP polypeptide. 
Where amino acid substitutions are made to a small unique fragment of a SAP polypeptide, 

30 such as a Smad or SH3 binding site peptide, the substitutions can be made by directly 
synthesizing the peptide. The activity of functionally equivalent fragments of SAP 
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polypeptides can be tested by cloning the gene encoding the altered SAP polypeptide into a 
bacterial or mammalian expression vector, introducing the vector into an appropriate host 
cell, expressing the altered SAP polypeptide, and testing for a functional capability of the 
SAP polypeptides as disclosed herein. Peptides which are chemically synthesized can be 

5 tested directly for function, e.g., for binding to Smad6 and/or Smad7. 

The invention as described herein has a number of uses, some of which are described 
elsewhere herein. First, the invention permits isolation of the SAP protein molecules (SEQ 
%$$iM&&2, 4 ■ssjSfl.iS) A varietyiof methodologies we^ kncw;iM skiUe#fc^^ 
utilized to obtain isolated SAP molecules. The polypeptide may be puffed from cells which 

10 naturally produce the polypeptide by chromatographic means or immunological recognition. 
Alternatively, an expression vector may be introduced into cells to cause production of the 
polypeptide. In another method, mRNA transcripts may be microinjected or otherwise 
introduced into cells to cause production of the encoded polypeptide. Translation of mRNA 
in cell-free extracts such as the reticulocyte lysate system also may be used to produce 

15 polypeptide. Those skilled in the art also can readily follow known methods for isolating 
SAP polypeptides. These include, but are not limited to, immunochromatography, HPLC, 
size-exclusion chromatography, ion-exchange chromatography and immune-affinity 
chromatography. 

The isolation of the SAP gene also makes it possible for the artisan to diagnose a 
20 disorder characterized by expression of SAP. These methods involve determining expression 
of the SAP gene, and/or SAP polypeptides derived therefrom. In the former situation, such 
determinations can, be carried out via any standard nucleic acid determination assay, including 
the polymerase chain reaction as exemplified in the examples below, or assaying with labeled 
hybridization probes. 

25 The invention also makes it possible isolate proteins such as Smad6 and Smad7 by the 

binding of such proteins to SAP as disclosed herein. The identification of this binding by 
SAP1, for example, also permits one of skill in the art to block the binding of Smad7 or 
Smad7 to other Smad-binding proteins, such as other SAPs, such as SAP2 or SAP3. Other 
SAPs can likewise by used to modulate protein binding to Smads. Binding of the proteins 

30 can be effected by introducing into a biological system in which the proteins bind (e.g., a cell) 
a SAP polypeptide including a Smad6 or Smad7 binding site in an amount sufficient to block 
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the binding. The identification of Smad binding sites in SAPs also enables one of skill in the 
art to prepare modified proteins, using standard recombinant DNA techniques, which can 
bind to proteins such as Smad6 and Smad7. For example, when one desires to target a certain 
protein to a Smad6 or Smad7 protein complex, one can prepare a fusion polypeptide of the 
5 protein and a SAP protein or a fragment thereof having a Smad binding site. Additional uses 
are described herein. 

The invention further provides methods for reducing or increasing TGF-p family 
signal tr^s^tjoifeto 'a'cdL^'Siiph'fe^cdis"^ useful in vitro for altering t?i TGS^sip^l 
transduction, ^example, in testing compounds for potential to block aberrant TGI p signal 

10 transduction or increase deficient TGF-P signal transduction. In v/vo, such methods are 

useful for modulating growth, e.g., to treat cancer and fibrosis. Such methods also are useful 
in the treatment of conditions which result from excessive or deficient TGF-P signal 
transduction. TGF-p signal transduction can be measured by a variety of ways known to one 
of ordinary skill in the art, such as the reporter systems described in the references cited in the 

15 Examples. Various modulators of SAP protein activity can be screened for effects on TGF-p 
signal transduction using the methods disclosed herein. The skilled artisan can first 
determine the modulation of a SAP activity, such as Smad binding or TGF-p signaling 
activity, and then apply such a modulator to a target cell or subject and assess the effect on 
the target cell or subject. For example, in screening for modulators of SAPs useful in the 

20 treatment of cancer, cells in culture can be contacted with SAP modulators and the increase or 
decrease of growth or focus formation of the cells can be determined according to standard 
procedures. SAP activity modulators can be assessed for their effects on other TGF-P signal 
transduction downstream effects by similar methods in many cell types. The foregoing also 
applies to signaling via activin and BMP complexes. 

25 The invention also provides, in certain embodiments, "dominant negative" 

polypeptides derived from SEQ ID NOs:2, 4 and/or 6. A dominant negative polypeptide is an 
inactive variant of a protein, which, by interacting with the cellular machinery, displaces an 
active protein from its interaction with the cellular machinery or competes with the active 
protein, thereby reducing the effect of the active protein. For example, a dominant negative 

30 receptor which binds a ligand but does not transmit a signal in response to binding of the 
ligand can reduce the biological effect of expression of the ligand. Likewise, a dominant 
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negative catalytically-inactive kinase which interacts normally with target proteins but does 
not phosphoiylate the target proteins can reduce phosphorylation of the target proteins in 
response to a cellular signal. Similarly, a dominant negative transcription factor which binds 
to a promoter site in the control region of a gene but does not increase gene transcription can 
5 reduce the effect of a normal transcription factor by occupying promoter binding sites without 
increasing transcription. 

The end result of the expression of a dominant negative polypeptide in a cell is a 
- - reduction in £ ncti&i rv^a^^frotein^e : On* of Ordinary in *he art ran *r?sss ifo ' " ':^Jgtos^> ' ; 
potential for a dominant negative variant of a protein, and using standard mutagenesis 

10 techniques to create one or more dominant negative variant polypeptides. For example, given 
the teachings contained herein of a SAP polypeptides, one of ordinary skill in the art can 
modify the sequence of the SAP polypeptides by site-specific mutagenesis, scanning 
mutagenesis, partial gene deletion or truncation, and the like. See, e.g., U.S. Patent No. 
5,580,723 and Sambrook et al., Molecular Cloning: A Laboratory Manual, Second Edition, 

15 Cold Spring Harbor Laboratory Press, 1989. The skilled artisan then can test the population 
of mutagenized polypeptides for diminution in a selected activity (e.g., Smad6 binding, 
modulation of TGF-jJ signaling activity) and/or for retention of such an activity. Other 
similar methods for creating and testing dominant negative variants of a protein will be 
apparent to one of ordinary skill in the art. 

20 Dominant negative SAP proteins can include variants in which a portion of the Smad 

binding site has been mutated or deleted to reduce or eliminate SAP interaction with Smad6 
or Smad7. Other examples include SAP variants in which the ability to accept 
phosphorylation by MAP kinases is reduced. One of ordinary skill in the art can readily 
prepare and test SAP variants bearing mutations or deletions in various portions of the 

25 polypeptide. 

The invention also involves agents such as polypeptides which bind to SAP 
polypeptides and to complexes of SAP polypeptides and binding partners such as Smad6 and 
Smad7. Such binding agents can be used, for example, in screening assays to detect the 
presence or absence of SAP polypeptides and complexes of SAP polypeptides and their 
30 binding partners and in purification protocols to isolate SAP polypeptides and complexes of 
SAP polypeptides and their binding partners. Such agents also can be used to inhibit the 
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native activity of the SAP polypeptides or their binding partners, for example, by binding to 
such polypeptides, or their binding partners or both. 

The invention, therefore, embraces peptide binding agents which, for example, can be 

antibodies or fragments of antibodies having the ability to selectively bind to SAP 
5 polypeptides. Antibodies include polyclonal and monoclonal antibodies, prepared according 

to conventional methodology. 

Significantly, as is well-known in the art, only a small portion of an antibody 
J r module* thepa ..... •;"%fe^m^ fi 

general, Clark, W.R. (1986) The Experimental Foundations of Modern Immunology Wiley & 
10 Sons, Inc., New York; Roitt, I. (1 991) Essential Immunology. 7th Ed., Blackwell Scientific 

Publications, Oxford). The pFc f and Fc regions, for example, are effectors of the complement 

cascade but are not involved in antigen binding. An antibody from which the pFc' region has 

been enzymatically cleaved, or which has been produced without the pFc* region, designated 

an F(ab% fragment, retains both of the antigen binding sites of an intact antibody. Similarly, 
15 an antibody from which the Fc region has been enzymatically cleaved, or which has been 

produced without the Fc region, designated an Fab fragment, retains one of the antigen 

binding sites of an intact antibody molecule. Proceeding further, Fab fragments consist of a 

covalently bound antibody light chain and a portion of the antibody heavy chain denoted Fd. 

The Fd fragments are the major determinant of antibody specificity (a single Fd fragment 
20 may be associated with up to ten different light chains without altering antibody specificity) 

and Fd fragments retain epitope-binding ability in isolation. 

Within the antigen-binding portion of an antibody, as is well-known in the art, there 

are complementarity determining regions (CDRs), which directly interact with the epitope of 

the antigen, and framework regions (FRs), which maintain the tertiary structure of the 
25 paratope (see, in general, Clark, 1986; Roitt, 1991). In both the heavy chain Fd fragment and 

the light chain of IgG immunoglobulins, there are four framework regions (FR1 through FR4) 

separated respectively by three complementarity determining regions (CDR1 through CDR3). 

The CDRs, and in particular the CDR3 regions, and more particularly the heavy chain 

CDR3, are largely responsible for antibody specificity. 
30 It is now well-established in the art that the non-CDR regions of a mammalian 

antibody may be replaced with similar regions of conspecific or heterospecific antibodies 
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while retaining the epitopic specificity of the original antibody. This is most clearly 
manifested in the development and use of "humanized" antibodies in which non-human 
CDRs are covalently joined to human FR and/or Fc/pFc 1 regions to produce a functional 
antibody. See, e.g., U.S. patents 4,816,567, 5,225,539, 5,585,089, 5,693,762 and 5,859,205. 
5 Thus, for example, PCT International Publication Number WO 92/043$ 1 teaches the 

production and use of humanized murine RSV antibodies in which at least a portion of the 
murine FR regions have been replaced by FR regions of human origin. Such antibodies, 
:,i \#fe^*^->' s ''- '.. " " ij?duding fragr^rir^ of intact antibodies v^Jth'ant^^n^bindiiig^lity. rke cf^tri^fereJ u> as ' 
"chimeric" antibodies. 

10 Thus, as will be apparent to one of ordinary skill in the art, the present invention also 

provides for F(ab') 2s Fab, Fv and Fd fragments; chimeric antibodies in which the Fc and/or 
FR and/or CDR1 and/or CDR2 and/or light chain CDR3 regions have been replaced by 
homologous human or non-human sequences; chimeric F(ab') 2 fragment antibodies in which 
the FR and/or CDR1 and/or CDR2 and/or light chain CDR3 regions have been replaced by 

15 homologous human or non-human sequences; chimeric Fab fragment antibodies in which the 
FR and/or CDR1 and/or CDR2 and/or light chain CDR3 regions have been replaced by 
homologous human or non-human sequences; and chimeric Fd fragment antibodies in which 
the FR and/or CDR1 and/or CDR2 regions have been replaced by homologous human or non- 
human sequences. The present invention also includes so-called single chain antibodies. 

20 Thus, the invention involves polypeptides of numerous size and type that bind 

specifically to SAP polypeptides, and complexes of both SAP polypeptides and their binding 
partners. These polypeptides may be derived also from sources other than antibody 
technology. For example, such polypeptide binding agents can be provided by degenerate 
peptide libraries which can be readily prepared in solution, in immobilized form or as phage 

25 display libraries. Combinatorial libraries also can be synthesized of peptides containing one 
or more amino acids. Libraries further can be synthesized of peptoids and non-peptide 
synthetic moieties. 

Phage display can be particularly effective in identifying binding peptides useful 
according to the invention. Briefly, one prepares a phage library (using e.g. ml 3, fd, or 
30 lambda phage), displaying inserts from 4 to about 80 amino acid residues using conventional 
procedures. The inserts may represent, for example, a completely degenerate or biased array. 
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One then can select phage-bearing inserts which bind to the SAP polypeptide. This process 
can be repeated through several cycles of reselection of phage that bind to the SAP 
polypeptide. Repeated rounds lead to enrichment of phage bearing particular sequences. 
DNA sequence analysis can be conducted to identify the sequences of the expressed 
5 polypeptides. The minimal linear portion of the sequence that binds to the SAP polypeptide 
can be determined. One can repeat the procedure using a biased library containing inserts 
containing part or all of the minimal linear portion plus one or more additional degenerate 
i i ^riesidues upstream or downstra am thereof Yeast two-hybrid screening method? 'm? w /b<* u ' 
used to identify polypeptides that bind to the SAP polypeptides. Thr^, the SAP polypeptides 

10 of the invention, or a fragment thereof, can be used to screen peptide libraries, including 
phage display libraries, to identify and select peptide binding partners of the SAP 
polypeptides of the invention. Such molecules can be used, as described, for screening 
assays, for purification protocols, for interfering directly with the functioning of SAP and for 
other purposes that will be apparent to those of ordinary skill in the art. 

15 A SAP polypeptide, or a fragment thereof, also can be used to isolate their native 

binding partners, including, e.g., Smad6, Smad7 and complexes containing those proteins. 
Isolation of such binding partners may be performed according to well-known methods. For 
example, isolated SAP polypeptides can be attached to a substrate (e.g., chromatographic 
media, such as polystyrene beads, or a filter), and then a solution suspected of containing a 

20 Smad6, Smad7 or complex thereof may be applied to the substrate. If a SAP binding partner 
which can interact with SAP polypeptides is present in the solution, then it will bind to the 
substrate-bound SAP polypeptide. The SAP binding partner then may be isolated. Other 
proteins which are binding partners for SAP, such as other Smads, cyclin A, etc., may be 
isolated by similar methods without undue experimentation. 

25 It will also be recognized that the invention embraces the use of SAP cDNAs 

sequences in expression vectors, as well as to transfect host cells and cell lines, be these 
prokaryotic (e.g., E coli)> or eukaryotic (e.g., CHO cells, COS cells, yeast expression systems 
and recombinant baculovirus expression in insect cells). Especially useful are mammalian 
cells such as human, mouse, hamster, pig, goat, primate, etc. They may be of a wide variety 

30 of tissue types, and include primary cells and cell lines. Specific examples include 

keratinocytes, peripheral blood leukocytes, bone marrow stem cells and embryonic stem cells. 
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The expression vectors require that the pertinent sequence, i.e., those nucleic acids described 
supra, be operably linked to a promoter. 

The invention also includes transgenic non-human animals. As used herein, 
"transgenic non-human animals" includes non-human animals having one or more exogenous 
5 nucleic acid molecules incorporated in germ line cells and/or somatic cells. Thus the 

transgenic animal include "knockout" animals having a homozygous or heterozygous gene 
disruption by homologous recombination, animals having episomal or chromosomally 
!nc6rnrtrat04l^^ etrfr'^oc^out animals can prepared! '"% ^iMh^M^'^ * * 

recombination using embryonic stem cells as is well known in the art. The recombination can 

1 0 be facilitated by the cre/lox system or other recombinase systems known to one of ordinary 
skill in the art. In certain embodiments, the recombinase system itself is expressed 
conditionally, for example, in certain tissues or cell types, at certain embryonic or post- 
embryonic developmental stages, inducibly by the addition of a compound which increases or 
decreases expression, and the like. In general, the conditional expression vectors used in such 

1 5 systems use a variety of promoters which confer the desired gene expression pattern (e.g., 
temporal or spatial). Conditional promoters also can be operably linked to SAP nucleic acid 
molecules to increase expression of SAP in a regulated or conditional manner. Trans-Bating 
negative regulators of SAP activity or expression also can be operably linked to a conditional 
promoter as described above. Such frara-acting regulators include antisense SAP nucleic 

20 acids molecules, nucleic acid molecules which encode dominant negative SAP molecules, 
ribozyme molecules specific for SAP nucleic acids, and the like. The transgenic non-human 
animals are useful in experiments directed toward testing biochemical or physiological effects 
of diagnostics or therapeutics for conditions characterized by increased or decreased SAP 
expression. Other uses will be apparent to one of ordinary skill in the art. 

25 The invention also contemplates gene therapy. The procedure for performing ex vivo 

gene therapy is outlined in U.S. Patent 5,399,346 and in exhibits submitted in the file history 
of that patent, all of which are publicly available documents. In general, it involves 
introduction in vitro of a functional copy of a gene into a cell(s) of a subject which contains a 
defective copy of the gene, and returning the genetically engineered cell(s) to the subject. 

30 The functional copy of the gene is under operable control of regulatory elements which 
permit expression of the gene in the genetically engineered cell(s). Numerous transfection 
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and transduction techniques as well as appropriate expression vectors are well known to those 
of ordinary skill in the art, some of which are described in PCT application WO95/00654. In 
vivo gene therapy using vectors such as adenovirus, retroviruses, herpes virus, and targeted 
liposomes also is contemplated according to the invention. 
5 The invention further provides efficient methods of identifying pharmacological 

agents or lead compounds for agents active at the level of a SAP or SAP fragment 
modulatable cellular function. In particular, such functions include TGF-p superfamily signal 

- - - - transfetie?- cydin rejpla^n^d formation ^ ' O-^cniJj;;, %j 'W^**®^ " 

screening methods invoh ^ assaying for compounds which interfere with a SAP activity such 

10 as Smad binding, etc, although compounds which enhance SAP activity also can be assayed 
using the screening methods. Such methods are adaptable to automated, high throughput 
screening of compounds. The target therapeutic indications for pharmacological agents 
detected by the screening methods are limited only in that the target cellular function be 
subject to modulation by alteration of the formation of a complex comprising a SAP 

15 polypeptide or fragment thereof and one or more natural SAP intracellular binding targets, 
such as Smad6. Target indications include cellular processes modulated by TGF-p 
superfamily signal transduction following receptor-ligand binding. 

A wide variety of assays for pharmacological agents are provided, including, labeled 
in vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays, 

20 cell-based assays such as two- or three-hybrid screens, expression assays, etc. For example, 
three-hybrid screens are used to rapidly examine the effect of transfected nucleic acids on the 
intracellular binding of SAP or SAP fragments to specific intracellular targets. The 
transfected nucleic acids can encode, for example, combinatorial peptide libraries or antisense 
molecules. Convenient reagents for such assays, e.g., GAL4 fusion proteins, are known in 

25 the art. An exemplary cell-based assay involves transfecting a cell with a nucleic acid 
encoding a SAP polypeptide fused to a GAL4 DNA binding domain and a nucleic acid 
encoding a Smad domain which interacts with SAP fused to a transcription activation domain 
such as VP16. The cell also contains a reporter gene operably linked to a gene expression 
regulatory region, such as one or more GAL4 binding sites. Activation of reporter gene 

30 transcription occurs when the SAP and Smad fusion polypeptides bind such that the GAL4 
DNA binding domain and the VP 16 transcriptional activation domain are brought into 
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proximity to enable transcription of the reporter gene. Agents which modulate a SAP 
polypeptide mediated cell function are then detected through a change in the expression of 
reporter gene. Methods for determining changes in the expression of a reporter gene are 
known in the art. 

5 SAP fragments used in the methods, when not produced by a transfected nucleic acid 

are added to an assay mixture as an isolated polypeptide. SAP polypeptides preferably are 
produced recombinant^, although such polypeptides may be isolated from biological 

" Reeomb^itSv produced SA^j^ly^tiac^ ^Rcfcdc'chlui^., pMW'.ias compos* ■ ■ " '' ' '^m^^r 

fusion of a SAP protein with another polypeptide, e.g., a polypeptide capable of providing or 

1 0 enhancing protein-protein binding, sequence specific nucleic acid binding (such as GAL4), 
enhancing stability of the SAP polypeptide under assay conditions, or providing a detectable 
moiety, such as green fluorescent protein or Flag epitope as provided in the examples below. 

The assay mixture is comprised of a natural intracellular SAP binding target such as 
Smad6 or a fragment thereof capable of interacting with SAP. While natural SAP binding 

15 targets may be used, it is frequently preferred to use portions (e.g., peptides or nucleic acid 
fragments) or analogs (i.e., agents which mimic the SAP binding properties of the natural 
binding target for purposes of the assay) of the SAP binding target so long as the portion or 
analog provides binding affinity and avidity to the SAP fragment measurable in the assay. 
The assay mixture also comprises a candidate pharmacological agent Typically, a 

20 plurality of assay mixtures are run in parallel with different agent concentrations to obtain a 
different response to the various concentrations. Typically, one of these concentrations 
serves as a negative control, i.e., at zero concentration of agent or at a concentration of agent 
below the limits of assay detection. Candidate agents encompass numerous chemical classes, 
although typically they are organic compounds. Preferably, the candidate pharmacological 

25 agents are small organic compounds, i.e., those having a molecular weight of more than 50 
yet less than about 2500, preferably less than about 1000 and, more preferably, less than 
about 500. Candidate agents comprise functional chemical groups necessary for structural 
interactions with polypeptides and/or nucleic acids, and typically include at least an amine, 
carbonyl, hydroxyl or carboxyl group, preferably at least two of the functional chemical 

30 groups and more preferably at least three of the functional chemical groups. The candidate 
agents can comprise cyclic carbon or heterocyclic structure and/or aromatic or polyaromatic 
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structures substituted with one or more of the above-identified functional groups. Candidate 
agents also can be biomolecules such as peptides, saccharides, fatty acids, sterols, 
isoprenoids, purines, pyrimidines, derivatives or structural analogs of the above, or 
combinations thereof and the like. Where the agent is a nucleic acid, the agent typically is a 
5 DNA or RNA molecule, although modified nucleic acids as defined herein are also 
contemplated. 

Candidate agents are obtained from a wide variety of sources including libraries of 
"'synthetic t^rx'^r&l compounds . ^ercx3rnpl^;nmeiM^ Mis- 
directed synthesis of a wide variety of organic comp cards and biomolecules, including 

1 0 expression of randomized oligonucleotides, synthetic organic combinatorial libraries, phage 
display libraries of random peptides, and the like. Alternatively, libraries of natural 
compounds in the form of bacterial, fungal, plant and animal extracts are available or readily 
produced. Additionally, natural and synthetically produced libraries and compounds can be 
readily be modified through conventional chemical, physical, and biochemical means. 

15 Further, known pharmacological agents may be subjected to directed or random chemical 
modifications such as acylation, alkylation, esterification, amidification, etc. to produce 
structural analogs of the agents. 

A variety of other reagents also can be included in the mixture. These include 
reagents such as salts, buffers, neutral proteins (e.g., albumin), detergents, etc. which may be 

20 used to facilitate optimal protein-protein and/or protein-nucleic acid binding. Such a reagent 
may also reduce non-specific or background interactions of the reaction components. Other 
reagents that improve the efficiency of the assay such as protease, inhibitors, nuclease 
inhibitors, antimicrobial agents, and the like may also be used. 

The mixture of the foregoing assay materials is incubated under conditions whereby, 

25 but for the presence of the candidate pharmacological agent, the SAP polypeptide specifically 
binds the cellular binding target, a portion thereof or analog thereof. The order of addition of 
components, incubation temperature, time of incubation, and other perimeters of the assay 
may be readily determined. Such experimentation merely involves optimization of the assay 
parameters, not the fundamental composition of the assay. Incubation temperatures typically 

30 are between 4°C and 40°C. Incubation times preferably are minimized to facilitate rapid, 
high throughput screening, and typically are between 0.1 and 10 hours. 
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After incubation, the presence or absence of specific binding between the SAP 
polypeptide and one or more binding targets is detected by any convenient method available 
to the user. For cell free binding type assays, a separation step is often used to separate bound 
from unbound components. The separation step may be accomplished in a variety of ways. 

5 Conveniently, at least one of the components is immobilized on a solid substrate, from which 
the unbound components may be easily separated. The solid substrate can be made of a wide 
variety of materials and in a wide variety of shapes, e.g., microtiter plate, microbead, dipstick, 
; ^rr^sin^p^cle: etc ^^s&brf^ifs preferably is chases to n>sSl^*r*itf^^ 

primarily to minimize background binding, as well as for ease of separation and cost. 

10 Separation may be effected for example, by removing a bead or dipstick from a 

reservoir, emptying or diluting a reservoir such as a microtiter plate well, rinsing a bead, 
particle, chromatographic column or filter with a wash solution or solvent. The separation 
step preferably includes multiple rinses or washes. For example, when the solid substrate is a 
microtiter plate, the wells may be washed several times with a washing solution, which 

15 typically includes those components of the incubation mixture that do not participate in 
specific bindings such as salts, buffer, detergent, non-specific protein, etc. Where the solid 
substrate is a magnetic bead, the beads may be washed one or more times with a washing 
solution and isolated using a magnet 

Detection may be effected in any convenient way for cell-based assays such as two- or 

20 three-hybrid screens. The transcript resulting from a reporter gene transcription assay of SAP 
polypeptide interacting with a target molecule typically encodes a directly or indirectly 
detectable product, e.g., p-galactosidase activity, luciferase activity, and the like. For cell 
free binding assays, one of the components usually comprises, or is coupled to, a detectable 
label A wide variety of labels can be used, such as those that provide direct detection (e.g., 

25 radioactivity, luminescence, optical or electron density, etc), or indirect detection (e.g., 

epitope tag such as the FLAG epitope, en2yme tag such as horseradish peroxidase, etc.). The 
label may be bound to a SAP binding partner, or incorporated into the structure of the binding 
partner. 

A variety of methods may be used to detect the label, depending on the nature of the 
30 label and other assay components. For example, the label may be detected while bound to the 
solid substrate or subsequent to separation from the solid substrate. Labels may be directly 
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detected through optical or electron density, radioactive emissions, nonradiative energy 
transfers, etc. or indirectly detected with antibody conjugates, strepavidin-biotin conjugates, 
etc. Methods for detecting the labels are well known in the art. 

The invention provides SAP-specific binding agents, methods of identifying and 
5 making such agents, and their use in diagnosis, therapy and pharmaceutical development. For 
example, SAP-specific pharmacological agents are useful in a variety of diagnostic and 
therapeutic applications, especially where disease or disease prognosis is associated with 
i^rcmei&itf^^ 

formation, (-GF-p superfamily signaling, cyclin regulation of the cell cycle, etc. : Jovel SAP- 
10 specific binding agents include SAP-specific antibodies and other natural intracellular 

binding agents identified with assays such as two hybrid screens, and non-natural intracellular 

binding agents identified in screens of chemical libraries and the like. 

In general, the specificity of SAP binding to a binding agent is shown by binding 

equilibrium constants. Targets which are capable of selectively binding a SAP polypeptide 
15 preferably have binding equilibrium constants of at least about 10 7 Nf 1 , more preferably at 

least about 10 8 M" 1 , and most preferably at least about 10 9 M* 1 . The wide variety of cell based 

and cell free assays may be used to demonstrate SAP-specific binding. Cell based assays 

include one, two and three hybrid screens, assays in which SAP-mediated transcription is 

inhibited or increased, etc. Cell free assays include SAP-protein binding assays, 
20 immunoassays, etc. Other assays useful for screening agents which bind SAP polypeptides 

include fluorescence resonance energy transfer (FRET), and electrophoretic mobility shift 

analysis (EMSA). 

Various techniques may be employed for introducing nucleic acids of the invention 
into cells, depending on whether the nucleic acids are introduced in vitro or in vivo in a host. 

25 Such techniques include transfection of nucleic acid-CaP0 4 precipitates, transfection of 

nucleic acids associated with DEAE, transfection with a retrovirus including the nucleic acid 
of interest, liposome mediated transfection, and the like. For certain uses, it is preferred to 
target the nucleic acid to particular cells. In such instances, a vehicle used for delivering a 
nucleic acid of the invention into a cell (e.g., a retrovirus, or other virus; a liposome) can have 

30 a targeting molecule attached thereto. For example, a molecule such as an antibody specific 
for a surface membrane protein on the target cell or a ligand for a receptor on the target cell 
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can be bound to or incorporated within the nucleic acid delivery vehicle. For example, where 
liposomes are employed to deliver the nucleic acids of the invention, proteins which bind to a 
surface membrane protein associated with endocytosis may be incorporated into the liposome 
formulation for targeting and/or to facilitate uptake. Such proteins include capsid proteins or 
5 fragments thereof tropic for a particular cell type, antibodies for proteins which undergo 
internalization in cycling, proteins that target intracellular localization and enhance 
intracellular half life, and the like. Polymeric delivery systems also have been used 
, . sn^cfscf L*I!y ■ tc ^^JivePynucJeic $rids if>*o c^lfTcs i£ 1 .^vwk by thcr^ rJk \ f *k£ va the* ' ' 
systems even permit r^l delivery of nucleic acids. 

10 When administered, the therapeutic compositions of the present invention are 

administered in pharmaceutical^ acceptable preparations. Such preparations may routinely 
contain pharmaceutical^ acceptable concentrations of salt, buffering agents, preservatives, 
compatible carriers, supplementary immune potentiating agents such as adjuvants and 
cytokines and optionally other therapeutic agents. 

15 The therapeutics of the invention can be administered by any conventional route, 

including injection or by gradual infusion over time. The administration may, for example, 
be oral, intravenous, intraperitoneal, intramuscular, intracavity, subcutaneous, or transdermal. 
When antibodies are used therapeutically, a preferred route of administration is by pulmonary 
aerosol. Techniques for preparing aerosol delivery systems containing antibodies are well 

20 known to those of skill in the art. Generally, such systems should utilize components which 
will not significantly impair the biological properties of the antibodies, such as the paratope 
binding capacity (see, for example, Sciarra and Cutie, "Aerosols," in Remington's 
Pharmaceutical Sciences. 18th edition, 1990, pp 1694-1712; incorporated by reference). 
Those of skill in the art can readily determine the various parameters and conditions for 

25 producing antibody aerosols without resort to undue experimentation. When using antisense 
preparations of the invention, slow intravenous administration is preferred. 

The compositions of the invention are administered in effective amounts. An 
"effective amount" is that amount of a composition that alone, or together with further doses, 
produces the desired response, e.g. alters favorably the signal transduction resulting from 

30 binding of a TGF-p superfamily ligand to specific receptors. In the case of treating a 

particular disease, such as cancer, the desired response is inhibiting the progression of the 
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disease. This may involve only slowing the progression of the disease temporarily, although 
more preferably, it involves halting the progression of the disease permanently. This can be 
monitored by routine methods or can be monitored according to diagnostic methods of the 
invention discussed herein. 
5 Such amounts will depend, of course, on the particular condition being treated, the 

severity of the condition, the individual patient parameters including age, physical condition, 
size and weight, the duration of the treatment, the nature of concurrent therapy (if any), the 
■" . , ^Ccifk .route of ^idr*>>^^ tffe ■^GwIoSg* and ^xperdse \'^Mm^^ lJ 

health practitioner. These factors are WeU known to those of ordinary skill in the art and can 

1 0 be addressed with no more than routine experimentation. It is generally preferred that a 
maximum dose of the individual components or combinations thereof be used, that is, the 
highest safe dose according to sound medical judgment. It will be understood by those of 
ordinary skill in the art, however, that a patient may insist upon a lower dose or tolerable dose 
for medical reasons, psychological reasons or for virtually any other reasons. 

15 The pharmaceutical compositions used in the foregoing methods preferably are sterile 

and contain an effective amount of SAP or nucleic acid encoding SAP for producing the 
desired response in a unit of weight or volume suitable for administration to a patient. The 
response can, for example, be measured by determining the signal transduction enhanced or 
inhibited by the SAP composition via a reporter system as described herein, by measuring 

20 downstream effects such as gene expression, or by measuring the physiological effects of the 
SAP composition, such as regression of a tumor or decrease of disease symptoms. Likewise, 
the effects of antisense SAP molecules can be readily determined by measuring expression of 
the individual genes in cells to which an antisense composition is added. Other assays will be 
known to one of ordinary skill in the art and can be employed for measuring the level of the 

25 response. 

The doses of SAP polypeptide or nucleic acid administered to a subject can be chosen 
in accordance with different parameters, in particular in accordance with the mode of 
administration used and the state of the subject. Other factors include the desired period of 
treatment. In the event that a response in a subject is insufficient at the initial doses applied, 
30 higher doses (or effectively higher doses by a different, more localized delivery route) may be 
employed to the extent that patient tolerance permits. 
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In general, doses of SAP are formulated and administered in doses between 1 ng and 
1 mg, and preferably between 10 ng and 100 ng, according to any standard procedure in the 
art. Where nucleic acids encoding SAP of variants thereof are employed, doses of between 
1 ng and 0.1 mg generally will be formulated and administered according to standard 
procedures. Other protocols for the administration of SAP compositions will be known to one 
of ordinary skill in the art, in which the dose amount, schedule of injections, sites of 
injections, mode of administration (e.g., intra-tumoral) and the like vary from the foregoing. 
Administration of *K ^ compositions tn mamsnah oth 

purposes or veterinary therapeutic puiposes, is carried oul -nder substantially the same 
conditions as described above. 

When administered, the pharmaceutical preparations of the invention are applied in 
pharmaceutically-acceptable amounts and in pharmaceutically-acceptable compositions. The 
term "pharmaceutically acceptable" means a non-toxic material that does not interfere with 
the effectiveness of the biological activity of the active ingredients. Such preparations may 
routinely contain salts, buffering agents, preservatives, compatible carriers, and optionally 
other therapeutic agents. When used in medicine, the salts should be pharmaceutically 
acceptable, but non-pharmaceutically acceptable salts may conveniently be used to prepare 
pharmaceutically-acceptable salts thereof and are not excluded from the scope of the 
invention. Such pharmacologically and pharmaceutically-acceptable salts include, but are not 
limited to, those prepared from the following acids: hydrochloric, hydrobromic, sulfuric, 
nitric, phosphoric, maleic, acetic, salicylic, citric, formic, malonic, succinic, and the like. 
Also, pharmaceutically-acceptable salts can be prepared as alkaline metal or alkaline earth 
salts, such as sodium, potassium or calcium salts. 

S APs may be combined, if desired, with a pharmaceutically-acceptable carrier. The 
term "pharmaceutically-acceptable carrier" as used herein means one or more compatible 
solid or liquid fillers, diluents or encapsulating substances which are suitable for 
administration into a human. The term "carrier" denotes an organic or inorganic ingredient, 
natural or synthetic, with which the active ingredient is combined to facilitate the application. 
The components of the pharmaceutical compositions also are capable of being co-mingled 
with the molecules of the present invention, and with each other, in a manner such that there 
is no interaction which would substantially impair the desired pharmaceutical efficacy. 
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The pharmaceutical compositions may contain suitable buffering agents, including: 
acetic acid in a salt; citric acid in a salt; boric acid in a salt; and phosphoric acid in a salt. 

The pharmaceutical compositions also may contain, optionally, suitable preservatives, 
such as: benzalkonium chloride; chlorobutanol; parabens and thimerosal, 
5 The pharmaceutical compositions may conveniently be presented in unit dosage form 

and may be prepared by any of the methods well-known in the art of pharmacy. All methods 
include the step of bringing the active agent into association with a carrier which constitutes 
**}»$M&^n& or more accessory ragradkats. In general, the c?n*pofit:on& " 
and intimately bringing the active compound into association wif* a liquid carrier, a finely 

1 0 divided solid carrier, or both, and then, if necessary, shaping the product 

Compositions suitable for oral administration may be presented as discrete units, such 
as capsules, tablets, lozenges, each containing a predetermined amount of the active 
compound. Other compositions include suspensions in aqueous liquids or non-aqueous 
liquids such as a syrup, elixir or an emulsion. 

15 Compositions suitable for parenteral administration conveniently comprise a sterile 

aqueous or non-aqueous preparation of SAP polypeptides or nucleic acids, which is 
preferably isotonic with the blood of the recipient. This preparation may be formulated 
according to known methods using suitable dispersing or wetting agents and suspending 
agents. The sterile injectable preparation also may be a sterile injectable solution or 

20 suspension in a non-toxic parenterally-acceptable diluent or solvent, for example, as a 
solution in 1,3-butane dioL Among the acceptable vehicles and solvents that may be 
employed are water, Ringer's solution, and isotonic sodium chloride solution. In addition, 
sterile, fixed oils are conventionally employed as a solvent or suspending medium. For this 
purpose any bland fixed oil may be employed including synthetic mono-or di-glycerides. In 

25 addition, fatty acids such as oleic acid may be used in the preparation of injectables. Carrier 
formulation suitable for oral, subcutaneous, intravenous, intramuscular, etc. administrations 
can be found in Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, PA. 

In another aspect of the invention, SAP polypeptides or nucleic acid are used in the 
manufacture of a medicament for modulating a TGF-p superfamily ligand response. The 

30 medicament can be placed in a vial and be incorporated into a kit to be used for increasing a 
subject's response to one or more TGF-P family members. In certain embodiments, other 
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medicaments which modulate the same responses or which favorably affect the SAP 
compositions can also be included in the same kit. The kits can include instructions or other 
printed material on how to administer the SAP compositions and any other components of the 
kit. 

5 

Examples 

Materials and Methods 

. DNA constj^tsr^'' - ■ ;v :' * -4^,^.,^^^ . ! 

pEC->mad6S, pEG-Smad6SN, pEG-SmadeSC, pEG-Smad7, pEG-Smad: ; and pEG- 

10 Smad7C were made by PCR and inserted into pEG202 (Golemis et al., Analysis of protein 
interactions, p. 20.1.1-20.1.40 In F. M. Ausubel, R. Brent, R.E. Kingston, D.D. Moore, J.G. 
Seidman, J. A. Smith and K, Struhl (eds.), Current Protocols in Molecular Biology, vol. 3, 
John Wiley & Sons, Inc., 1999). pEG-Smad2 and pEG-Smad4 was obtained from Dr. R. 
Derynk(Wuetal.,M>/. Ceil Biol 17:2521-2528., 1997). 6xMyc-Smadl, 6xMyc-Smad2 

15 and 6xMyc-Smad3 were provided by Dr. K. Miyazono (Nishihara et al., Genes. Cells. 3:613- 
623, 1998). 6xMyc-Smad4, 6xMyc-Smad6S, 6xMyc-Smad6L and 6xMyc-Smad7 were 
constructed using 6xMyc-pCDNA3 (Nishihara et al., 1998). Flag-AMSH, Flag- 
AMSH(DBS2), Flag-AMSH(DBS3), Flag-AMSH(DC2), Flag-AMSH(DJS), Flag- 
AMSH(DNL) were described previously (Tanaka et al., J. Biol Chem. 274:19129-19135, 

20 1999). Flag-AMSH(A1 02-233), Flag-AMSH(1 06-424) and Flag-AMSH(234-424) were 
generated by PCR and subcloned into pCMV2-Flag vector (Sigma). 

Yeast two-hybrid screening 

Several constructs of LexA-Smad fusions in the pEG202 vector and human fetal brain 

25 library in the pJG4-5 vector were used. Library screens were carried out using Leu2 and 
P-galactosidase reporters (pSHl 8-34) within the yeast strain, EGY48. In brief, EGY48 cells 
were transformed with pEG-Smad6SN, pSH 18-34 and library and plated in galactose- 
containing medium without histidine. Positive colonies were picked 3-5 days after plating 
(Golemis et al., 1999). Subsequently, positive colonies were tested again and confirmed as 

30 real positive clones. 
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DNA sequence analysis 

The nucleotide sequences were determined for both strands with an ABB 10 DNA 
sequencer. 



5 Immunoprecipitation and Western blotting 

Combinations of Smads and AMSH or its mutants in the presence or absence of 
ALK5ca or ALK6ca were transfected in COS7 cells at 12 x 10 6 cells/10 cm-dish usino 

of lysis buffer (20 mM Tris [pH/,4], 150 mM NaCl, 10% glycerol, 1% Triton X-100, 1 mM 
10 PMSF and 100 units/ml Trasylol). The cell Iysates were precleared with protein G-Sepharose 
beads (Pharmacia) and incubated with Flag M5 antibody (Sigma) for 2 h at 4°C. 
Subsequently, protein G-Sepharose beads were added to the reaction mixture and samples 
were incubated for 30 min at 4°C. After washing the immunoprecipitates with lysis buffer 
three times, immunoprecipitates and aliquots of cell Iysates before immunoprecipitation were 
15 separated by SDS-polyacrylamide gel electrophoresis and transferred to a Hybond-C extra 
membrane (Amersham). The membrane was then probed with Flag M5 or Myc (9E1 0 
monoclonal antibody; Santa Cruz) antibody. Primary antibodies were detected with a 
horseradish peroxidase-conjugated goat anti-mouse antibody (Amersham) and a 
chemiluminescent substrate. 

20 

[ 32 PJOrthophosphate labeling of cells, tryptic phosphopeptide mapping and two- 
dimensional phosphoamino acid analysis 

COS7 cells were labeled in phosphate-free medium for 3 h. Subsequently, 1 mCi/ml 
[ 32 P]orthophosphate was added in the culture medium. After 40 min, the cells were lysed, 

25 immunoprecipitated with anti-FlagM5 antibody, separated by SDS-polyacrylamide gel 

electrophoresis and transferred to a Hybond-C extra membrane. For tryptic phosphopeptide 
mapping, AMSH bands were localized by exposure on a FujiX Bio-Imager (Fuji), excised 
from the filter and digested in situ with trypsin (modified sequencing grade; Promega). Two- 
dimensional phosphopeptide mapping was done using the Hunter thin-layer electrophoresis 

30 apparatus (HTLE-7000; CBS Scientific), essentially as described by Boyle et al. (Methods 
Enzymol 201 : 1 10-149, 1991). First dimension electrophoresis was performed in pH 1 .9 
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buffer (formic acid:glacial acetic acid: water; 44: 156:1 800) for 23 min at 2000 V, and second 
dimension ascending thin-layer chromatography in isobutyric acid buffer (isobutyric acid:n- 
butanol:pyridine:glacial acetic acid:water; 1250:38:96:58:558). After exposure, 
phosphopeptides were eluted from the plates in the pH 1.9 buffer and lyophilized. The 
5 fractions were then subjected to two-dimensional phosphoamino acid analysis. 

Example 1: Isolation of Smad Associating Proteins 

To explore £^ ^voidns that interact ■-. "' 

with Smad6 and Smad7 have been i&biated using the yeast two hybrid system. 

10 Using the N-terminal half of Smad6 MH2 domain (pEG-Smad6SN) (Fig. 1) as a bait 

to screen a human fetal brain library (4 x 10 6 colonies), 12 kinds of distinct positive cDNA 
clones encoding 7 known and 5 unknown proteins were obtained; the latter molecules were 
termed Smad6 associating proteins (SAPs) 1 through 5 (Fig. 2). Subsequently, the interaction 
of identified molecules with Smad2, Smad4, Smad6 or Smad7 was investigated using the 

15 yeast two hybrid system. As seen in Fig. 2, all clones except for dodecenoyl-CoA could bind 
to either Smad6S or Smad7. However, no or very weak interactions between Smad2 or 
Smad4 and the identified molecules were seen. Among the novel cDNAs, SAPs 1-3 were 
further analyzed, of which multiple positive clones were isolated (Fig. 2). The entire coding 
sequences for SAP1 (Fig. 3 and SEQ ID NO: 1) and SAP2 (Fig. 4 and SEQ ID NO:3) were 

20 obtained as expressed sequence tags (ESTs), but no ESTs encoding SAP3 (SEQ ID NO:5) 
could be found. 

SAP1 was previously isolated and termed "associated molecule with the SH 3 domain 
of STAM" (AMSH) (Tanaka et al., 1999). Thus, SAP1 is called AMSH in the following, 
AMSH was originally found to interact with the signal transducing adaptor molecule 
25 (STAM). AMSH has three unique motifs in its structure, i.e., a nuclear translocation! signal, 
an SH3 binding site (SXXP; SEQ ID NO:7) and a JAB1 subdomain homologous region 
(JSH) (Fig. 5). 

In order to investigate whether AMSH interacts with Smads in vivo, COS7 cells were 
transfected with Flag-tagged AMSH and different Myc-tagged-Smads (Smadl, Smad4, 
30 Smad6S, Smad6L and Smad7) in the absence and presence of constitutively active ALK6 
(ALK6ca). Samples were then subjected to immunoprecipitation with Flag antibodies and 
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blotting with Myc antibodies. AMSH interacted with Smad4, Smad6S and Smad6L weakly 
in the absence of constitutively activate ALK6 in COS7 cells. Interestingly, upon 
transfection with constitutively active ALK6 the interaction of AMSH with Smad4, Smad6S 
and Smad6L increased. Smad7 constitutively bound to AMSH. However, Smadl did not 
5 associate with AMSH. 

In a similar experiment, ALK5-dependent interaction of AMSH was explored. COS7 
cells were transfected with Flag-tagged AMSH and different Myc-tagged Smadr (Smad2 
/Sm^^^Stn^^^^^SjlSmsdeL ard Stt^AT) 'intfrfi ^bsmdesind pr^xixczof^^ ; .h'eh* N 
activated ALK5. Interestingly, activated ALK5 (ALKSca) promoted the interaction of 

10 AMSH with Smad2 and in particular Smad3, while Smad4, Smad6S, Smad6L and Smad7 
interact with AMSH independent of ALK5. 

It is known that Smad6 inhibits the BMP pathway more efficiently than the TGF-p 
pathway (Hata et al., Genes DeveL 12:186-197, 1998). Since BMP receptors possess an 
intrinsic serine/threonine kinase, it was examined whether or not AMSH was phosphorylated, 

15 and it was determined that AMSH was phosphorylated by activated ALK6. However, AMSH 
might not be a direct substrate for the serine/threonine kinase of ALK6 because the 
phosphorylation of AMSH was detected 4 h after the treatment with OP-1 in COS7 cells 
which were reconstituted with AMSH, ALK6 and BMPR-II. 

TGF-P family signaling has been known to be mediated in part through MAP kinase 

20 pathways (Atfi et al., J. Biol Chem. 272:1429-1432, 1997; Sano et al., J. Biol Chem. 

274:8949-8957, 1999). Therefore, the effect of MAP kinase inhibitors was investigated on 
the phosphorylation of AMSH. Cells were incubated with inhibitors 3 h before the addition 
of [ 32 P]orthophosphate, SB203580, a p38 inhibitor, inhibited the phosphorylation of AMSH 
in a dose-dependent manner, whereas PD98059, an ERK inhibitor, had no effect. The effect 

25 of the third MAP kinase pathway, JNK, on the phosphorylation of AMSH was not 

investigated because no commercial inhibitor is available. The MAP kinases that mainly 
contribute to the phosphorylation of AMSH are confirmed using dominant negative JNK and 
p38 in phosphorylation experiments as described above. 

Often, the phosphorylation status of a protein correlates within biological activity. 

30 Thus, tryptic phosphopeptide mapping of AMSH stimulated with ALK6ca we preformed was 
performed. Four major phosphopeptides were induced by ALK6ca. Phosphoamino acid 
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analysis revealed that only serine residues were phosphorylated. The exact position of the 
phosphorylated serine residues in the phosphopeptides is identified by, e.g., amino acid 
sequencing of the phosphopeptides. 

Deletion mutants of AMSH were made to find important regions for biological 
5 activity (Fig. 5). The in vivo interaction with Smad6L was investigated for two of the 
mutants which were found to associate with Smad6L in the presence of active ALK6. In 
particular, AMSH(DC2) which lacks the C-terminal half of AMSH int^cted with Smad6L 
• ^rited moire eSxcier*!;/ wi&Sh^j&f: ii- ilie piozztlCc of ^ALK*j&;, ? Moti^'hc c^r " 
experiment using all mutants depicted in Fig. 5 is performed to identify portions r A AMSH 
10 which interact with Smad6 and Smad7. 

The phosphorylation of AMSH mutants by ALK6ca was tested as well. 
AMSH(DBS2) was highly phosphorylated. On the other hand, the phosphorylation of 
AMSH(DC2) was very weak. These observations suggest that the N-terminal part of AMSH 
is involved in the interaction with Smad6L, whereas phosphorylation sites are localized in the 
15 C-terminal part. The phosphorylation of other mutants depicted in Fig. 5 by ALK6ca also is 
performed to confirm results and further localize phosphorylation sites. 

The effect of AMSH and mutants thereof are tested in a luciferase assay for TGF-p- 
family-dependent activity (e.g., Jonk et al., J. Biol Chem. 273:21 145-21 152, 1998) as well as 
for their effect in aXenopus animal cap assay (e.g., Nakao et al., Nature 389:631-635, 1997). 
20 A Xenopus homologue of S AP2 was recently identified and termed XDRP 1 . 

(GenBank accession number AB030502; Funakoshi et al., EMBOJ. 18:5009-5018, 1999). It 
was reported that XDRP 1 binds to cyclin A and inhibits its degradation. Since cyclin A is 
involved in the cell cycle, it is possible that Smad6L regulates the cell cycle through the 
interaction with S AP2. Alternatively, SAP2 may regulate the degradation of Smad6L. 
25 Other aspects of the invention will be clear to the skilled artisan and need not be 

repeated here. Each reference cited herein is incorporated by reference in its entirety. 

The terms and expressions which have been employed are used as terms of description 
and not of limitation, and there is no intention in the use of such terms and expressions of 
excluding any equivalents of the features shown and described or portions thereof, it being 
30 recognized that various modifications are possible within the scope of the invention. 
We claim: 
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CLA1MS 

1 . An isolated nucleic acid molecule selected from the group consisting of 

(a) nucleic acid molecules which hybridize under stringent conditions to a molecule 
consisting of the nucleic acid sequence set forth in SEQ ID NO:3 or SEQ ID NO:5 and which 

5 code for a polypeptide which binds Smad6, 

(b) nucleic acid molecules that differ from the nucleic acid molecules of (a) in codon 
sequence due to the degeneracy of the genetic code, and 

:V ^-^v(jc) cor^??l ^.f (2ypnd{h v , ■ • ■■^^■-^^ ■ " 

10 2. The isolated nucleic acid molecule of claim 1 > wherein the isolated nucleic acid 
molecule consists of SEQ ID NO:3, 

3. The isolated nucleic acid molecule of claim 1 , wherein the isolated nucleic acid 
molecule consists of SEQ ID NO:5. 

15 

4. An isolated nucleic acid molecule selected from the group consisting of (a) a unique 
fragment of nucleotides 1-2399 of SEQ ID NO:3 between 12 and 2398 nucleotides in length, 
(b) a unique fragment of nucleotides 1-855 of SEQ ID NO: 5 between 12 and 854 nucleotides 
in length, (c) complements of "(a)" and (d) complements of "(b)", provided that the nucleic 

20 acid molecule excludes sequences consisting of GenBank accession numbers AI2191 12 and 
N33797, AB030502. 

5. The isolated nucleic acid molecule of claim 4, wherein the isolated nucleic acid 
molecule consists of at least 22, 25, 30, 40, 50, 75 or 100 contiguous nucleotides. 

25 

6. The isolated nucleic acid molecule of claim 4, wherein the isolated nucleic acid 
molecule consists of between 20 and 32 contiguous nucleotides. 

7. An expression vector comprising the isolated nucleic acid molecule of any of claims 
30 1 , 2, 3 or 4 operably linked to a promoter. 
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8. A host cell transformed or transfected with the expression vector of claim 7. 



9. A method for producing a polypeptide comprising culturing the host cell of claim 8 
under conditions which permit the exp[ression of polypeptide. 

5 

1 0. The method of claim 9, further comprising isolating the polypeptide. 

10 12, An isolated polypeptide encoded by the isolated nucleic acid molecule of any of 
claims 1,2 or 3. 

1 3 . The isolated polypeptide of claim 1 2, wherein the isolated polypeptide is selected 
from the group consisting of molecules comprising the amino acid sequence of SEQ ID 

15 NO:4, SEQ ID NO:6, a fragment or functional variant of SEQ ID NO:4, and a fragment or 
functional variant of SEQ ID NO:6. 

14. An isolated polypeptide comprising a fragment or functional variant of SEQ ID NO:2. 

20 15. The isolated polypeptide of claim 14, wherein the isolated polypeptide consists of a 
fragment of SEQ ID NO:2 selected from the group consisting of amino acids 1-101+234-424, 
106-424 and 234-424. 

16. An isolated polypeptide which binds selectively a polypeptide encoded by the isolated 
25 nucleic acid molecule of any of claims 1 , 2 or 3, provided that the isolated polypeptide is not 

a Smad, STAM or cyclin polypeptide. 

1 7. The isolated polypeptide of claim 1 6, wherein the isolated polypeptide binds to an 
epitope defined by a polypeptide consisting of the sequence of SEQ ID NOs:2, 4 or 6. 

30 

18. The isolated polypeptide of claim 16, wherein the isolated polypeptide is an antibody 
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fragment selected from the group consisting of a Fab fragment, a F(ab) 2 fragment or a 
fragment including a CDR3 region selective for a SAP polypeptide, 

19. The isolated polypeptide of claim 16, wherein the isolated polypeptide is a 
5 monoclonal antibody, a humanized antibody or a chimeric antibody. 

20. An isolated complex of polypeptides comprising: 

■] r-wS- polypeptide as okimec rtervr- 12bouridto a j:>cl>7?^ptic?e scfcjr j ; 01 the groins, 
consisting of Smad6, Smad7 and fragments thereof. 

10 

21 . A method for modulating TGF-p superfamily signal transduction in a mammalian 
cell, comprising 

contacting the mammalian cell with an amount of an agent which increases the 
amount of a Smad associating protein selected from the group consisting of SAP1/AMSH 
1 5 (SEQ ID NO:2), SAP2 (SEQ ID NO:4), SAP3 (SEQ ID NO:6), Hsp40 homolog (U40992), 
Uba80 (X63237), Tax-1 binding protein (U33822), rabaptin-5 (NM_004703), and 26S 
proteinase S5a (U51007) or a fragment thereof in the cell effective to reduce TGF-P 
superfamily signal transduction in the mammalian cell. 

20 22. The method of claim 21, wherein the agent is a nucleic acid molecule. 

23. A method for regulating the cell cycle in a mammalian cell, comprising 
contacting the mammalian cell with an amount of an agent which increases the 

amount of SAP2 (SEQ ID NO:4), or a fragment thereof, in the cell effective to bind a cyclin 
25 and regulate the cell cycle in the mammalian cell. 

24. A method for identifying lead compounds for a pharmacological agent, comprising 
forming a mixture comprising a Smad6 or Smad7 polypeptide, a SAP polypeptide, 

and a candidate pharmacological agent, 
30 incubating the mixture under conditions which, in the absence of the candidate 

pharmacological agent, permit a first amount of specific binding of the SAP polypeptide by 



WO 01/21794 



PCT/US00/25790 



-45- 

the Smad6 or Smad7 polypeptide, and 

detecting a test amount of the specific binding of the SAP polypeptide by the Smad6 
or Smad7 polypeptide, wherein reduction of the test amount of specific binding relative to the 
first amount of specific binding indicates that the candidate pharmacological agent is a lead 
compound for a pharmacological agent which disrupts the Smad6-SAP or Smad7-S AP 
binding, and wherein increase of the test amount of specific binding relative to the first 
amount of specific binding indicates that the candidate pharmacological agent is a lead 
.■■coiapsnrd fife* # pharmacological ngrett ^" 'ch enhance*' the Smsd6~SAP c*r-Sr:^-7-S A? ■ 
binding. 

25. The method of claim 24, wherein the SAP polypeptide is selected from the group 
consisting of SAP1/AMSH, SAP2, SAP3 and fragments thereof. 

26. A method for identifying lead compounds for a pharmacological agent, comprising 
forming a mixture comprising an ALK kinase, a Smad polypeptide, a SAP 

polypeptide, and a candidate pharmacological agent, 

incubating the mixture under conditions which, in the absence of the candidate 
pharmacological agent, permit a first amount of specific binding of the SAP polypeptide by 
the Smad polypeptide, and 

detecting a test amount of the specific binding of the SAP polypeptide by the Smad 
polypeptide, wherein reduction of the test amount of specific binding relative to the first 
amount of specific binding indicates that the candidate pharmacological agent is a lead 
compound for a pharmacological agent which disrupts the Smad-SAP binding, and wherein 
increase of the test amount of specific binding relative to the first amount of specific binding 
indicates that the candidate pharmacological agent is a lead compound for a pharmacological 
agent which enhances the Smad-SAP binding. 

27. The method of claim 26, wherein the SAP polypeptide is selected from the group 
consisting of SAP1/AMSH, SAP2, SAP3 and fragments thereof. 

28. The method of claim 26, wherein the Smad polypeptide is selected from the group 
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consisting of Smad2, Smad3, Smad4, Smad6, Smad7 and fragments thereof. 

29. The method of claim 26, wherein the ALK kinase is selected from the group 
consisting of ALK5, constitutively activated ALK5, ALK6, constitutively activated ALK6 
and fragments thereof having kinase activity. 
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SEQUENCE LISTING 

<110> Ludwig Institute for Cancer Research 

<120> SMAD ASSOCIATING POLYPEPTIDES 

10 <130> L0461/7069WO 

O b0> ns 60/154, 840 

<160> 16 

20 <170> Patent In version 3.0 

<210> 1 

25 <211> 2003 

<212> DNA 

<213> Homo sapiens 

30 

<220> 

<221> CDS 

<222> (63} . . (1334) 



35 



<400> 1 

40 gaattcggca cgaggtttcc ggaacctccg ggtgtcatcc gcggggaaag aacttggtcc 60 

tg atg tct gac cat gga gat gtg age etc ccg ccc gaa gac egg gtg 107 
Met Ser Asp His Gly Asp Val Ser Leu Pro Pro Glu Asp Arg Val 
15 10 15 

45 

agg get etc tec cag ctg ggt agt gcg gta gag gtg aat gaa gac att 155 
Arg Ala Leu Ser Gin Leu Gly Ser Ala Val Glu Val Asn Glu Asp He 
20 25 30 

50 cca ccc cgt egg tac ttc cgc tct gga gtt gag att ate cga atg gca 203 
Pro Pro Arg Arg Tyr Phe Arg Ser Gly Val Glu He He Arg Met Ala 
35 40 45 



tec att tac tct gag gaa ggc aac att gaa cat gec ttc ate etc tat 251 
55 Ser He Tyr Ser Glu Glu Gly Asn He Glu His Ala Phe He Leu Tyr 
50 55 60 

aac aag tat ate acg etc ttt att gag aaa eta cca aaa cat cga gat 299 
Asn Lys Tyr He Thr Leu Phe He Glu Lys Leu Pro Lys His Arg Asp 
60 65 70 75 
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tac aaa tct get gtc att cct gaa aag aaa gac aca gta aag aaa tta 347 
Tyr Lys Ser Ala Val He Pro Glu Lys Lys Asp Thr Val Lys Lys Leu 
80 85 90 95 

5 

aag gag att gca ttt ccc aaa gca gaa gag ctg aag gca gag ctg tta 395 
Lys Glu He Ala Phe Pro Lys Ala Glu Glu Leu Lys Ala Glu Leu Leu 
100 105 110 

10 aaa cga tat acc aaa gaa tat aca gaa tat aat gaa gaa aag aag aag 4 43 

Lys Arg Tyr Thr Lys Glu Tyr Thr Glu Tyr Asn Glu Glu Lys Lys Lys 
115 120 125 

gaa qna gag gaa ttg gee egg aac atg gee ate c *g caa y&g ctg g^a 491 
15 ^1 ■ - 21.7 \.<ru Ala Avg Het .-•>>- r v.. C-?.:i n Clu 1^ ,;. 

130 * 135 i*0 

aag gaa aaa cag agg gta gca caa cag aag cag cag caa ttg gaa cag 539 
Lys Glu Lys Gin Arg Val Ala Gin Gin Lys Gin Gin Gin Leu Glu Gin 
20 145 150 155 

gaa cag ttc cat gec ttc gag gag atg ate egg aac cag gag eta gaa 587 

Glu Gin Phe His Ala Phe Glu Glu Met He Arg Asn Gin Glu Leu Glu 

160 165 170 175 

25 

aaa gag cga ctg aaa att gta cag gag ttt ggg aag gta gac cct ggc 635 

Lys Glu Arg Leu Lys He Val Gin Glu Phe Gly Lys Val Asp Pro Gly 
180 185 190 

30 eta ggt ggc ccg eta gtg cct gac ttg gag aag ccc tec tta gat gtg 683 
Leu Gly Gly Pro Leu Val Pro Asp Leu Glu Lys Pro Ser Leu Asp Val 
195 200 205 

ttc ccc acc tta aca gtc tea tec ata cag cct tea gac tgt cac aca 731 
35 Phe Pro Thr Leu Thr Val Ser Ser He Gin Pro Ser Asp Cys His Thr 
210 215 220 

act gta agg cca get aag cca cct gtg gtg gac agg tec ttg aaa cct 779 
Thr Val Arg Pro Ala Lys Pro Pro Val Val Asp Arg Ser Leu Lys Pro 
40 225 230 235 

gga gca ctg age aac tea gaa agt att ccc aca ate gat gga ttg cgc 827 
Gly Ala Leu Ser Asn Ser Glu Ser He Pro Thr He Asp Gly Leu Arg 
240 245 250 255 

45 

cat gtg gtg gtg cct ggg egg ctg tgc cca cag ttt etc cag tta gee 875 
His Val Val Val Pro Gly Arg Leu Cys Pro Gin Phe Leu Gin Leu Ala 
260 265 270 

50 agt gec aac act gec egg gga gtg gag aca tgt gga att etc tgt gga 923 
Ser Ala Asn Thr Ala Arg Gly Val Glu Thr Cys Gly He Leu Cys Gly 
275 280 285 

aaa ctg atg agg aat gaa ttt acc att acc cat gtt etc ate ccc aag 971 
55 Lys Leu Met Arg Asn Glu Phe Thr He Thr His Val Leu He Pro Lys 
290 295 300 

caa agt get ggg tct gat tac tgc aac aca gag aac gaa gaa gaa ctt 1019 
Gin Ser Ala Gly Ser Asp Tyr Cys Asn Thr Glu Asn Glu Glu Glu Leu 
60 305 310 315 
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ttc etc ata cag gat cag cag ggc etc ate aca ctg ggc tgg att cat 1067 
Phe Leu He Gin Asp Gin Gin Gly Leu He Thr Leu Gly Trp He His 
320 325 330 335 

5 

act cac ccc aca cag acc gcg ttt etc tec agt gtc gac eta cac act 1115 
Thr His Pro Thr Gin Thr Ala Phe Leu Ser Ser Val Asp Leu His Thr 
340 345 350 

10 cac tgc tct tac cag atg atg ttg cca gag tea gta gee att gtt tgc 1163 
His Cys Ser Tyr Gin Met Met Leu Pro Glu Ser Val Ala He Val Cys 
355 360 365 

tec ccc oag L;.c cag gaa act gga ttc ttt aaa eta act 'tac cat gga 1^11 
IS Sot P: ^ 11^ Thr Gly I>he~ l%e '-T.cu TUx .^sp' Ills GI- 

370 375 3b0 

eta gag gag att tct tec tgt cgc cag aaa gga ttt cat cca cac age 1259 
Leu Glu Glu He Ser Ser Cys Arg Gin Lys Gly Phe His Pro His Ser 
20 385 390 395 

aag gat cca cct ctg ttc tgt age tgc age cac gtg act gtt gtg gac 1307 
Lys Asp Pro Pro Leu Phe Cys Ser Cys Ser His Val Thr Val Val Asp 
400 405 410 415 

25 

aga gca gtg acc ate aca gac ctt cga tgagcgtttg agtccaacac 1354 
Arg Ala Val Thr He Thr Asp Leu Arg 
420 

30 
35 
40 
45 
50 

<210> 2 
55 <211> 424 
<212> PRT 



cttccaagaa 


caacaaaacc 


atatcagtgt 


actgtagccc 


cttaatttaa 


gctttctaga 


1414 


aagctttgga 


agtttttgta 


gatagtagaa 


aggggggcat 


cacctgagaa 


agagctgatt 


1474 


ttgtatttca 


ggtttgaaaa gaaataactg 


aacatatttt 


ttaggcaagt 


cagaaagaga 


1534 


acatggtcac 


ccaaaagcaa 


ctgtaactca 


gaaattaagt 


tactcagaaa 


ttaagtagct 


1594 


cagaaattaa 


gaaagaatgg tataatgaac 


ccccatatac 


ccttccttct 


ggattcacca 


1654 


attgttaaca 


tttttttcct 


ctcagctatc 


cttctaattt 


ctctctaatt 


tcaatttgtt 


1714 


tatatttacc 


tctgggctca 


ataagggcat 


ctgtgcagaa 


atttggaagc 


catttagaaa 


1774 


atcttttgga 


ttttcctgtg 


gtttatggca 


atatgaatgg 


agcttattac 


tggggtgagg 


1834 


gacagcttac 


tccatttgac 


cagattgttt 


ggctaacaca 


tcccgaagaa 


tgattttgtc 


1894 


aggaattatt 


gttatttaat 


aaatatttca 


ggatattttt 


cctctacaat 


aaagtaacaa 


1954 


ttaacttaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaactcgag 




2003 



<213> Homo sapiens 

60 



WO 01/21794 



PCT/USOO/25790 



<400> 2 

Met Ser Asp His Gly Asp Val Ser Leu Pro Pro Glu Asp Arg Val Arg 
5 1 5 10 15 



Ala Leu Ser Gin Leu Gly Ser Ala Val Glu Val Asn Glu Asp lie Pro 
20 25 30 

10 

Pro Arg Arg Tyr Phe Arg Ser Gly Val Glu lie lie Arg Met Ala Ser 
35 40 45 

lie Tyx oor Glu Glu Gly Asn lie Glu Ala Phe lie Leu Tyr A an 

50 55 60 



20 Lys Tyr lie Thr Leu Phe lie Glu Lys Leu Pro Lys His Arg Asp Tyr 
65 70 75 80 



Lys Ser Ala Val He Pro Glu Lys Lys Asp Thr Val Lys Lys Leu Lys 
25 85 90 95 



Glu He Ala Phe Pro Lys Ala Glu Glu Leu Lys Ala Glu Leu Leu Lys 
100 105 110 

30 

Arg Tyr Thr Lys Glu Tyr Thr Glu Tyr Asn Glu Glu Lys Lys Lys Glu 
115 120 125 

35 

Ala Glu Glu Leu Ala Arg Asn Met Ala He Gin Gin Glu Leu Glu Lys 
130 135 140 



40 Glu Lys Gin Arg Val Ala Gin Gin Lys Gin Gin Gin Leu Glu Gin Glu 
145 150 155 160 



Gin Phe His Ala Phe Glu Glu Met He Arg Asn Gin Glu Leu Glu Lys 
45 165 170 175 



Glu Arg Leu Lys He Val Gin Glu Phe Gly Lys Val Asp Pro Gly Leu 
180 185 190 

50 

Gly Gly Pro Leu Val Pro Asp Leu Glu Lys Pro Ser Leu Asp Val Phe 
195 200 205 

55 

Pro Thr Leu Thr Val Ser Ser He Gin Pro Ser Asp Cys His Thr Thr 
210 215 220 



60 Val Arg Pro Ala Lys Pro Pro Val Val Asp Arg Ser Leu Lys Pro Gly 
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225 230 235 240 

Ala Leu Ser Asn Ser Glu Ser lie Pro Thr He Asp Gly Leu Arg His 
5 245 250 255 



Val Val Val Pro Gly Arg Leu Cys Pro Gin Phe Leu Gin Leu Ala Ser 
260 265 270 

10 

Ala Asn Thr Ala Arg Gly Val Glu Thr Cys Gly He Leu Cys Gly Lys 
275 280 

15 ■ ' . ' ' ' 

Leu Met Arg Asn Glu Phe ?hr He Thr His Val Leu 11- »ro Lys Gin 
290 295 300 



20 Ser Ala Gly Ser Asp Tyr Cys Asn Thr Glu Asn Glu Glu Glu Leu Phe 
305 310 315 320 



Leu He Gin Asp Gin Gin Gly Leu He Thr Leu Gly Trp He His Thr 
25 325 330 335 



His Pro Thr Gin Thr Ala Phe Leu Ser Ser Val Asp Leu His Thr His 
340 345 350 

30 

Cys Ser Tyr Gin Met Met Leu Pro Glu Ser Val Ala lie Val Cys Ser 
355 360 365 

35 

Pro Lys Phe Gin Glu Thr Gly Phe Phe Lys Leu Thr Asp His Gly Leu 
370 375 380 



40 Glu Glu He Ser Ser Cys Arg Gin Lys Gly Phe His Pro His Ser Lys 
385 390 395 400 



Asp Pro Pro Leu Phe Cys Ser Cys Ser His Val Thr Val Val Asp Arg 
45 405 410 415 



Ala Val Thr He Thr Asp Leu Arg 
420 

50 

<210> 3 
<211> 2434 

55 

<212> DNA 



<213> Homo sapiens 

60 
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<220> 

<221> CDS 
5 <222> (270) . .(2036) 



10 



20 



<400> 3 

aagcggtggc tgctgcggat gtcggtgtga gcgagcggcg cctgaacaca cggcggctgc 60 

cgagcgcctg acccgggcct gcgccagagc ctgcaccgag ctccggggcc ccacacccgc 120 

taccgtggoo ctgrgcrcgt tgctaotgag gcggcgtgct c*:q--; cctt czctot ecen IPO 

ga^ucaccgc cgccgccgcc gccacagcc atg gcc gag agt ggt gaa age ggc 293 

Met Ala Glu Ser Gly Glu Ser Gly 
1 5 

ggt cct ccg ggc tec cag gat age gcc gcc gga gcc gaa ggt get ggc 341 
Gly Pro Pro Gly Ser Gin Asp Ser Ala Ala Gly Ala Glu Gly Ala Gly 
10 15 20 

25 gcc ccc gcg gcc get gcc tec gcg gag ccc aaa ate atg aaa gtc acc 389 
Ala Pro Ala Ala Ala Ala Ser Ala Glu Pro Lys lie Met Lys Val Thr 
25 30 35 40 

gtg aag acc ccg aag gaa aag gag gaa ttc gcc gtg ccc gag aat age 437 
30 Val Lys Thr Pro Lys Glu Lys Glu Glu Phe Ala Val Pro Glu Asn Ser 

45 50 " 55 

tec gtc cag cag ttt aag gaa gaa ate tct aaa cgt ttt aaa tea cat 485 
Ser Val Gin Gin Phe Lys Glu Glu lie Ser Lys Arg Phe Lys Ser His 
35 60 65 70 

act gac caa ctt gtg ttg ata ttt get gga aaa att ttg aaa gat caa 533 
Thr Asp Gin Leu Val Leu He Phe Ala Gly Lys He Leu Lys Asp Gin 
75 80 85 

40 

gat acc ttg agt cag cat gga att cat gat gga ctt act gtt cac ctt 581 
Asp Thr Leu Ser Gin His Gly He His Asp Gly Leu Thr Val His Leu 
90 95 100 

45 gtc att aaa aca caa aac agg cct cag gat cat tea get cag caa aca 629 
Val He Lys Thr Gin Asn Arg Pro Gin Asp His Ser Ala Gin Gin Thr 
105 110 115 120 

aat aca get gga age aat gtt act aca tea tea act cct aat agt aac 677 
50 Asn Thr Ala Gly Ser Asn Val Thr Thr Ser Ser Thr Pro Asn Ser Asn 

125 130 135 

tct aca tct ggt tct get act age aac cct ttt ggt tta ggt ggc ctt 725 
Ser Thr Ser Gly Ser Ala Thr Ser Asn Pro Phe Gly Leu Gly Gly Leu 
55 140 145 150 

ggg gga ctt gca ggt ctg agt age ttg ggt ttg aat act acc aac ttc 773 

Gly Gly Leu Ala Gly Leu Ser Ser Leu Gly Leu Asn Thr Thr Asn Phe 

155 160 165 

60 
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tct gaa eta cag agt cag atg cag cga caa ctt ttg tct aac cct gaa 821 
Ser Glu Leu Gin Ser Gin Met Gin Arg Gin Leu Leu Ser Asn Pro Glu 
170 175 180 

5 atg atg gtc cag ate atg gaa aat ccc ttt gtt cag age atg etc tea 869 
Met Met Val Gin lie Met Glu Asn Pro Phe Val Gin Ser Met Leu Ser 
185 190 195 200 

aat cct gac ctg atg aga cag tta att atg gec aat cca caa atg cag 917 
10 Asn Pro Asp Leu Met Arg Gin Leu lie Met Ala Asn Pro Gin Met Gin 

205 210 215 

r.^r* )\tq at a cag aga aat cca gaa att agt cat atg tta ent nst ..vet 965 
win Leu- lie Gin Arg Asn. Pro Glu lie 3a His net Leu Asn h^tt Pro 1 
■ ^ . ;220 • -25 * :>30 

gat ata atg aga caa acg ttg gaa ctt gec agg aat cca gca atg atg Ioj.3 

Asp lie Met Arg Gin Thr Leu Glu Leu Ala Arg Asn Pro Ala Met Met 

235 240 245 

20 

cag gag atg atg agg aac cag gac cga get ttg age aac eta gaa age 1061 

Gin Glu Met Met Arg Asn Gin Asp Arg Ala Leu Ser Asn Leu Glu Ser 

250 255 260 



25 



ate cca ggg gga tat aat get tta agg cgc atg tac aca gat att cag 1109 
lie Pro Gly Gly Tyr Asn Ala Leu Arg Arg Met Tyr Thr Asp lie Gin 
265 270 275 280 



gaa cca atg ctg agt get gca caa gag cag ttt ggt ggt aat cca ttt 1157 
30 Glu Pro Met Leu Ser Ala Ala Gin Glu Gin Phe Gly Gly Asn Pro Phe 

285 290 295 

get tec ttg gtg age aat aca tec tct ggt gaa ggt agt caa cct tec 1205 
Ala Ser Leu Val Ser Asn Thr Ser Ser Gly Glu Gly Ser Gin Pro Ser 
35 300 305 310 

cgt aca gaa aat aga gat cca eta ccc aat cca tgg get cca cag act 1253 

Arg Thr Glu Asn Arg Asp Pro Leu Pro Asn Pro Trp Ala Pro Gin Thr 

315 320 325 

40 

tec cag agt tea tea get tec age ggc act gee age act gtg ggt ggc 1301 

Ser Gin Ser Ser Ser Ala Ser Ser Gly Thr Ala Ser Thr Val Gly Gly 

330 335 340 

45 act act ggt agt act gee agt ggc act tct ggg cag agt act act gcg 134 9 
Thr Thr Gly Ser Thr Ala Ser Gly Thr Ser Gly Gin Ser Thr Thr Ala 
345 350 355 360 

cca aat ttg gtg cct gga gta gga get agt atg ttc aac aca cca gga 1397 
50 Pro Asn Leu Val Pro Gly Val Gly Ala Ser Met Phe Asn Thr Pro Gly 

365 370 375 

atg cag age ttg ttg caa caa ata act gaa aac cca caa ctg atg caa 1445 
Met Gin Ser Leu Leu Gin Gin lie Thr Glu Asn Pro Gin Leu Met Gin 
55 380 385 390 

aac atg ttg tct gee ccc tac atg aga age atg atg cag tea eta age 14 93 

Asn Met Leu Ser Ala Pro Tyr Met Arg Ser Met Met Gin Ser Leu Ser 

395 400 405 

60 
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cag aat cct gac ctt get gca cag atg atg ctg aat aat ccc eta ttt 
Gin Asn Pro Asp Leu Ala Ala Gin Met Met Leu Asn Asn Pro Leu Phe 
410 415 420 



1541 



10 



get gga aat cct cag ctt caa gaa caa atg aga caa cag etc cca act 

Ala Gly Asn Pro Gin Leu Gin Glu Gin Met Arg Gin Gin Leu Pro Thr 

425 430 435 440 

ttc etc caa caa atg cag aat cct gat aca eta tea gca atg tea aac 

Phe Leu Gin Gin Met Gin Asn Pro Asp Thr Leu Ser Ala Met Ser Asn 

445 450 455 

cct aga crca atq rag.coc ttg tta cag att carr C a*j ggt tta cag aca 

Pro Arg Ala Mec Gin -Ala Leu Leu Gin lie Gin. Gin Giy Leu oln Thr 



1589 



1637 



20 



tta gca acg gaa gee ccg ggc etc ate cca ggg ttt act cct ggc ttg 1733 
Leu Ala Thr Glu Ala Pro Gly Leu He Pro Gly Phe Thr Pro Gly Leu 
475 480 485 

ggg gca tta gga age act gga ggc tct teg gga act aat gga tct aac 1781 
Gly Ala Leu Gly Ser Thr Gly Gly Ser Ser Gly Thr Asn Gly Ser Asn 
4 90 4 95 500 



25 



gee aca cct agt gaa aac aca agt ccc aca gca gga ace act gaa cct 
Ala Thr Pro Ser Glu Asn Thr Ser Pro Thr Ala Gly Thr Thr Glu Pro 
505 510 515 520 



1829 



30 



gga cat cag cag ttt att cag cag atg ctg cag get ctt get gga gta 
Gly His Gin Gin Phe He Gin Gin Met Leu Gin Ala Leu Ala Gly Val 
525 530 535 



1877 



35 



aat cct cag eta cag aat cca gaa gtc aga ttt cag caa caa ctg gaa 
Asn Pro Gin Leu Gin Asn Pro Glu Val Arg Phe Gin Gin Gin Leu Glu 
540 545 550 



1925 



40 



caa etc agt gca atg gga ttt ttg aac cgt gaa gca aac ttg caa get 1973 
Gin Leu Ser Ala Met Gly Phe Leu Asn Arg Glu Ala Asn Leu Gin Ala 
555 560 565 

eta ata gca aca gga ggt gat ate aat gca get att gaa agg tta ctg 2021 
Leu He Ala Thr Gly Gly Asp He Asn Ala Ala He Glu Arg Leu Leu 
570 575 580 



45 ggc tec cag cca tea tagcagcatt tctgtatctt gaaaaaatgt aatttatttt 
Gly Ser Gin Pro Ser 
585 



2076 



50 



55 



60 



tgataaegge 


tcttaaactt 


taaaatacct 


gctttatttc 


attttgactc 


ttggaattct 


2136 


gtgctgttat 


aaacaaaccc 


aatatgatgc 


attttaaggt 


ggagtacagt 


aagatgtgtg 


2196 


ggtttttctg 


tatttttctt 


ttctggaaca 


gtgggaatta 


aggctactgc 


atgeatcact 


2256 


tetgeattta 


ttgtaatttt 


ttaaaaacat 


caccttttat 


agttgggtga 


ccagattttg 


2316 


tcctgcatct 


gtccagttta 


tttgcttttt 


aaacattagc 


ctatggtagt 


aatttatgta 


2376 


gaataaaagc 


attaaaaaga 


agcaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aactcgag 


2434 
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<210> 4 

<211> 589 

5 

<212> PRT 

<213> Homo sapiens 

10 

<400> 4 

Mp 1 * Ala Glu Ser fily c.i.ii r-r' ^Xy Gly Pro Pro G) y Ssr Cln Asp Ser 



Ala Ala Gly Ala Glu Gly Ala Gly Ala Pro Ala Ala Ala Ala Ser Ala 
20 25 30 

20 

Glu Pro Lys lie Met Lys Val Thr Val Lys Thr Pro Lys Glu Lys Glu 
35 40 45 



25 Glu Phe Ala Val Pro Glu Asn Ser Ser Val Gin Gin Phe Lys Glu Glu 
50 55 60 



lie Ser Lys Arg Phe Lys Ser His Thr Asp Gin Leu Val Leu lie Phe 
30 65 70 75 80 



Ala Gly Lys lie Leu Lys Asp Gin Asp Thr Leu Ser Gin His Gly lie 
85 90 95 

35 

His Asp Gly Leu Thr Val His Leu Val lie Lys Thr Gin Asn Arg Pro 
100 105 110 

40 

Gin Asp His Ser Ala Gin Gin Thr Asn Thr Ala Gly Ser Asn Val Thr 
115 120 125 



45 Thr Ser Ser Thr Pro Asn Ser Asn Ser Thr Ser Gly Ser Ala Thr Ser 
130 135 140 



Asn Pro Phe Gly Leu Gly Gly Leu Gly Gly Leu Ala Gly Leu Ser Ser 
50 145 150 155 160 



Leu Gly Leu Asn Thr Thr Asn Phe Ser Glu Leu Gin Ser Gin Met Gin 
165 170 175 

55 

Arg Gin Leu Leu Ser Asn Pro Glu Met Met Val Gin lie Met Glu Asn 
180 185 190 



60 
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Pro Phe Val Gin Ser Met Leu Ser Asn Pro Asp Leu Met Arg Gin Leu 
195 200 205 



5 lie Met Ala Asn Pro Gin Met Gin Gin Leu He Gin Arg Asn Pro Glu 
210 215 220 



lie Ser His Met Leu Asn Asn Pro Asp He Met Arg Gin Thr Leu Glu 
10 225 230 235 240 



Lf»u 7U:-'Arg Asn Pro Ala Met Met. 01 r> Olis- Met m<?M Arc A&v G^n A«r- 

-■/:.. i^ 1 ;. ^5 250 - ■ '-255' 



Arg Ala Leu Ser Asn Leu Glu Ser He Pro Gly Gly I'yr Asn Ala Leu 
260 265 270 



20 



Arg Arg Met Tyr Thr Asp He Gin Glu Pro Met Leu Ser Ala Ala Gin 
275 280 285 



25 Glu Gin Phe Gly Gly Asn Pro Phe Ala Ser Leu Val Ser Asn Thr Ser 
290 295 300 



Ser Gly Glu Gly Ser Gin Pro Ser Arg Thr Glu Asn Arg Asp Pro Leu 
30 305 310 315 320 



35 



Pro Asn Pro Trp Ala Pro Gin Thr Ser Gin Ser Ser Ser Ala Ser Ser 
325 330 335 



40 



Gly Thr Ala Ser Thr Val Gly Gly Thr Thr Gly Ser Thr Ala Ser Gly 
340 345 350 



Thr Ser Gly Gin Ser Thr Thr Ala Pro Asn Leu Val Pro Gly Val Gly 
355 360 365 



45 Ala Ser Met Phe Asn Thr Pro Gly Met Gin Ser Leu Leu Gin Gin He 
370 375 380 



Thr Glu Asn Pro Gin Leu Met Gin Asn Met Leu Ser Ala Pro Tyr Met 
50 385 390 395 400 



55 



Arg Ser Met Met Gin Ser Leu Ser Gin Asn Pro Asp Leu Ala Ala Gin 
405 410 415 



Met Met Leu Asn Asn Pro Leu Phe Ala Gly Asn Pro Gin Leu Gin Glu 
420 425 430 



60 
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Gin Met Arg Gin Gin Leu Pro Thr Phe Leu Gin Gin Met Gin Asn Pro 
435 440 445 

5 Asp Thr Leu Ser Ala Met Ser Asn Pro Arg Ala Met Gin Ala Leu Leu 
450 455 460 

Gin He Gin Gin Gly Leu Gin Thr Leu Ala Thr Glu Ala Pro Gly Leu 
10 465 470 475 480 

Jle Pro Gly Phe Tf>r Pro Gly Leu Gly Ala Leu. Gly S^r Gly 

^^:> : ■ 485 4 90 ' 495 

15 .■" • ' ■' 

Ser Ser Gly Thr Asn Gly Ser Asn Ala Thr Pro Ser Glu Asn Thr Ser 
500 505 510 

20 

Pro Thr Ala Gly Thr Thr Glu Pro Gly His Gin Gin Phe He Gin Gin 
515 520 525 

25 Met Leu Gin Ala Leu Ala Gly Val Asn Pro Gin Leu Gin Asn Pro Glu 
530 535 540 

Val Arg Phe Gin Gin Gin Leu Glu Gin Leu Ser Ala Met Gly Phe Leu 
30 545 550 555 560 

Asn Arg Glu Ala Asn Leu Gin Ala Leu He Ala Thr Gly Gly Asp He 
565 570 575 

35 

Asn Ala Ala He Glu Arg Leu Leu Gly Ser Gin Pro Ser 
580 585 

40 

<210> 5 
<211> 882 
45 <212> DNA 

<213> Homo sapiens 

50 <220> 

<221> CDS 

<222> {!).. (630) 

55 

<400> 5 

gaa ttc ggc acg agg cgc ggt ccc ccc etc tec etc cgc ttc gca etc 48 
Glu Phe Gly Thr Arg Arg Gly Pro Pro Leu Ser Leu Arg Phe Ala Leu 
60 1 5 10 15 
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20 



ccg teg ggt acg gga agg tec aag ccg ctg ccg ggt gec cga ggg ccg 96 
Pro Ser Gly Thr Gly Arg Ser Lys Pro Leu Pro Gly Ala Arg Gly Pro 
20 25 30 

teg tgg ccg ccg teg cca egg gtc cca atg gag ccg ccg aat etc tat 144 
Ser Trp Pro Pro Ser Pro Arg Val Pro Met Glu Pro Pro Asn Leu Tyr 
35 40 45 

ccg gtg aag etc tac gtg tac gac ctg tec aaa ggc ctg gec egg egg 192 
Pro Val Lys Leu Tyr Val Tyr Asp Leu Ser Lys Gly Leu Ala Arg Arg 
50 55 60 

etc age ecu ate atg ctg ggg aaa ua ctg gaa ggc ate tag eac aqa 240 

^ L*- - Sxr Pro Met 5,eu y Lys >V :\ Leu Glu ? . \ :» r> T;u ff" ^Or. " 

o5 70 75 U C 

tec ata gtt gtg cac aag gat gag ttc ttc ttc ggc agt ggt ggt ate 288 
Ser lie Val Val His Lys Asp Glu Phe Phe Phe Gly Ser Gly Gly lie 
20 85 90 95 

tec age tgc ccc ccg gga ggg aca ttg ctt ggg cct cca gac tct gtg 336 

Ser Ser Cys Pro Pro Gly Gly Thr Leu Leu Gly Pro Pro Asp Ser Val 
100 105 110 

25 

gtt gat gtg ggg agt aca gaa gtc aca gaa gaa ate ttc ttc tgg agt 384 

Val Asp Val Gly Ser Thr Glu Val Thr Glu Glu lie Phe Phe Trp Ser 
115 120 125 

30 ace tct cct ccc tgg ggg agt ccc tgt ttc cga ggt gag gec tac aac 432 
Thr Ser Pro Pro Trp Gly Ser Pro Cys Phe Arg Gly Glu Ala Tyr Asn 
130 135 140 

etc ttt gaa cac aat tgt aac acc ttc age aac gaa gtg gca cag ttc 480 
35 Leu Phe Glu His Asn Cys Asn Thr Phe Ser Asn Glu Val Ala Gin Phe 
145 150 155 160 

ctg act ggg egg aag att cct tct tac ate aca gac ctg ccc tct gaa 528 
Leu Thr Gly Arg Lys lie Pro Ser Tyr He Thr Asp Leu Pro Ser Glu 
40 165 170 175 

gtt etc tec acg ccc ttt gga cag gca ctt egg ccc etc ctg gac tec 576 

Val Leu Ser Thr Pro Phe Gly Gin Ala Leu Arg Pro Leu Leu Asp Ser 
180 185 190 

45 

att cag ate cag cct cca gga ggg age tec gtg ggc aga ccc aac ggc 624 

lie Gin He Gin Pro Pro Gly Gly Ser Ser Val Gly Arg Pro Asn Gly 
195 200 205 

50 cag age taacaggact gcctgggacc gccctgcctc accagggctt ttccttttta 680 
Gin Ser 
210 



55 



aacaaaacaa accctaccag atttctattt tataatttta catcagagct aacaaccagg 74 0 

ggacggcttt ttaaatttcc cagggaagga gaeegtcagg ccgcatgtag acaatgetge 800 

taagaaacag aacaaaatgc caccccttct aatagtatta tactaattta ttaagaaaaa 8 60 

60 aaaaaaaaaa aaaaaactcg ag 882 
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<210> 6 
5 <211> 210 
<212> PRT 
<213> Homo sapiens 

10 

<400> 6 

Glu I*he Gly Thr Arg Ax^ Gly Pro Pro Leu- Ser L-u Arg J?he Ala j>u 
'15" 1 ■ ^ 'j " ' • 10 . 1 15 



Pro Ser Gly Thr Gly Arg Ser Lys Pro Leu Pro Gly Ala Arg Gly Pro 
20 25 30 

20 

Ser Trp Pro Pro Ser Pro Arg Val Pro Met Glu Pro Pro Asn Leu Tyr 
35 40 45 

25 

Pro Val Lys Leu Tyr Val Tyr Asp Leu Ser Lys Gly Leu Ala Arg Arg 
50 55 60 



30 Leu Ser Pro lie Met Leu Gly Lys Gin Leu Glu Gly lie Trp His Thr 
65 70 75 80 



Ser lie Val Val His Lys Asp Glu Phe Phe Phe Gly Ser Gly Gly He 
35 85 90 95 



Ser Ser Cys Pro Pro Gly Gly Thr Leu Leu Gly Pro Pro Asp Ser Val 
100 105 110 

40 

Val Asp Val Gly Ser Thr Glu Val Thr Glu Glu He Phe Phe Trp Ser 
115 120 125 

45 

Thr Ser Pro Pro Trp Gly Ser Pro Cys Phe Arg Gly Glu Ala Tyr Asn 
130 135 140 



50 Leu Phe Glu His Asn Cys Asn Thr Phe Ser Asn Glu Val Ala Gin Phe 
145 150 155 160 



Leu Thr Gly Arg Lys He Pro Ser Tyr He Thr Asp Leu Pro Ser Glu 
55 165 170 175 



Val Leu Ser Thr Pro Phe Gly Gin Ala Leu Arg Pro Leu Leu Asp Ser 
180 185 190 

60 
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Ile Gin lie Gin Pro Pro Gly Gly Ser Ser Val Gly Arg Pro Asn Gly 
195 200 205 



Gin Ser 
210 



10 <210> 7 

<211> 1318 

... .<2i.;> DHA 

<2i3> Homo sapiens 



20 



25 



30 



<220> 

<221> CDS 

<222> (177) . . (1190) 

<400> 7 

cgggaaggat tgaatacgag acgctgtctg cttgctgcct 
ctgattaact tttaaaatac ccagcttggt ttatttttct 
ggggacgctg ttttctttta caaagggaaa tctaagttaa 



taagacagct agctgaattg 60 

tagaatctgt tgctaagact 120 

tttcaaggca ttcgaa atg 17 9 
Met 
1 



35 ggg aaa gac tat tat tgc att ttg gga att gag 

Gly Lys Asp Tyr Tyr Cys lie Leu Gly lie Glu 
5 10 

gaa gat att aaa aag get tac cga aaa caa gec 

40 Glu Asp lie Lys Lys Ala Tyr Arg Lys Gin Ala 
20 25 



45 



50 



aaa gga get tea gat 
Lys Gly Ala Ser Asp 
15 



227 



gac aag aac aaa tct cct cag gca gag gaa aaa 

Asp Lys Asn Lys Ser Pro Gin Ala Glu Glu Lys 

35 40 

gaa get tat gaa gta ttg agt gat cct aaa aag 

Glu Ala Tyr Glu Val Leu Ser Asp Pro Lys Lys 
50 55 60 

cag ttt ggg gag gaa ggg ttg aaa gga gga gca 

Gin Phe Gly Glu Glu Gly Leu Lys Gly Gly Ala 
70 75 



55 caa gga ggt acc ttc egg tac ace ttt cat ggc 
Gin Gly Gly Thr Phe Arg Tyr Thr Phe His Gly 
85 90 

ttt get gca ttt ttc gga ggg tec aac ccc ttt 
60 Phe Ala Ala Phe Phe Gly Gly Ser Asn Pro Phe 



etc aaa ttt cat ccg 275 
Leu Lys Phe His Pro 
30 

ttt aaa gag gtc gca 323 

Phe Lys Glu Val Ala 

45 

aga gaa ata tat gat 371 
Arg Glu He Tyr Asp 
65 

gga ggt act gat gga 419 
Gly Gly Thr Asp Gly 
80 

gat cct cat get aca 4 67 

Asp Pro His Ala Thr 
95 

gaa att ttc ttt gga 515 
Glu He Phe Phe Gly 
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100 105 110 

aga cga atg ggt ggt ggt aga gat tct gaa gaa atg gaa at a gat ggt 563 
Arg Arg Met Gly Gly Gly Arg Asp Ser Glu Glu Met Glu He Asp Gly 
5 115 120 125 

gat cct ttt agt gcc ttt ggt ttc age atg aat gga tat cca aga gac 611 
Asp Pro Phe Ser Ala Phe Gly Phe Ser Met Asn Gly Tyr Pro Arg Asp 
130 135 140 145 



10 



agg aat tct gtg ggg cca tec cgc etc aaa caa gat cct cca gtt att 
Arg Asn Ser Val Gly Pro Ser Arg Leu Lys Gin Asp Pro Pro Val He 

150 155 . 1^0 



aga tct gag gac aaa att ctt acc att gag att aaa aaa ggg tgg aaa 
Arg Ser Glu Asp Lys He Leu Thr He Glu He Lys Lys Gly Trp Lys 
25 195 200 205 

gaa ggc acc aaa att act ttt cca aga gaa gga gat gaa aca cca aat 
Glu Gly Thr Lys He Thr Phe Pro Arg Glu Gly Asp Glu Thr Pro Asn 
210 215 220 225 

30 

agt att cca gca gac att gtt ttt ate att aaa gac aaa gat cat cca 
Ser He Pro Ala Asp He Val Phe He He Lys Asp Lys Asp His Pro 
230 235 240 

35 aaa ttt aaa agg gat gga tea aat ata att tat act get aaa att agt 
Lys Phe Lys Arg Asp Gly Ser Asn He He Tyr Thr Ala Lys He Ser 
245 250 255 

tta cga gag gca ttg tgt ggc tgc tea att aat gta cca aca ctg gat 
40 Leu Arg Glu Ala Leu Cys Gly Cys Ser He Asn Val Pro Thr Leu Asp 
260 265 270 

gga aga aac ata cct atg tea gta aat gat att gtg aaa ccc gga atg 
Gly Arg Asn He Pro Met Ser Val Asn Asp He Val Lys Pro Gly Met 
45 275 280 285 

agg aga aga att att gga tat ggg ctg cca ttt cca aaa aat cct gac 

Arq Arg Arg He He Gly Tyr Gly Leu Pro Phe Pro Lys Asn Pro Asp 

290 295 300 305 

50 

caa cgt ggt gac ctt eta ata gaa ttt gag gtg tec ttc cca gat act 

Gin Arq Gly Asp Leu Leu He Glu Phe Glu Val Ser Phe Pro Asp Thr 

310 315 320 

55 ata tct tct tea tec aaa gaa gta ctt agg aaa cat ctt cct gcc tea 
He Ser Ser Ser Ser Lys Glu Val Leu Arg Lys His Leu Pro Ala Ser 
325 330 335 



659 



\% re** aga qtc. U ^'.t -r^ tat ;-cT 'J-J" -'9 1 " — /u ' 

His Glu Leu Arg Val Ser Leu Glu Glu II? ryr Ser Gly Cys Thr Lys 
165 170 175 

egg atg aag att tct cga aaa agg eta aac get gat gga agg agt tac 755 

20 Arg Met Lys He Ser Arg Lys Arg Leu Asn Ala Asp Gly Arg Ser Tyr 

180 185 190 



803 



851 



899 



947 



995 



1043 



1091 



1139 



1187 



tag aatgaagaac tttgttacac atattttgat aaggcactga aaatataaaa 1240 



60 
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ggactggtag tttactgatg tagatgtgaa ttctgtataa agatgtgtaa attgttttga 1300 
gggttcatta aattgcat 1318 

5 

<210> 8 
<211> 337 
10 <212> PRT 

<213> Homo sapiens 

Met Gly Lys Asp Tyr Tyr Cys lie Leu Gly He Glu Lys Gly Ala Ser 
15 10 15 

20 

Asp Glu Asp He Lys Lys Ala Tyr Arg Lys Gin Ala Leu Lys Phe His 
20 25 30 



25 Pro Asp Lys Asn Lys Ser Pro Gin Ala Glu Glu Lys Phe Lys Glu Val 
35 40 45 



Ala Glu Ala Tyr Glu Val Leu Ser Asp Pro Lys Lys Arg Glu He Tyr 
30 50 55 60 



Asp Gin Phe Gly Glu Glu Gly Leu Lys Gly Gly Ala Gly Gly Thr Asp 
65 70 75 80 

35 

Gly Gin Gly Gly Thr Phe Arg Tyr Thr Phe His Gly Asp Pro His Ala 
85 90 95 

40 

Thr Phe Ala Ala Phe Phe Gly Gly Ser Asn Pro Phe Glu He Phe Phe 
100 105 110 



45 Gly Arg Arg Met Gly Gly Gly Arg Asp Ser Glu Glu Met Glu He Asp 
115 120 125 



Gly Asp Pro Phe Ser Ala Phe Gly Phe Ser Met Asn Gly Tyr Pro Arg 
50 130 135 140 



Asp Arg Asn Ser Val Gly Pro Ser Arg Leu Lys Gin Asp Pro Pro Val 
145 150 155 160 



55 



He His Glu Leu Arg Val Ser Leu Glu Glu He Tyr Ser Gly Cys Thr 
165 170 175 



60 
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Lys Arg Met Lys lie Ser Arg Lys Arg Leu Asn Ala Asp Gly Arg Ser 
180 185 190 



5 Tyr Arg Ser Glu Asp Lys lie Leu Thr He Glu lie Lys Lys Gly Trp 
195 200 205 



Lys Glu Gly Thr Lys lie Thr Phe Pro Arg Glu Gly Asp Glu Thr Pro 
10 210 215 220 



As r - Ser Tie Pro Ala Asp He Val Pha He He l*y$ Asp T .vr His 
225 230 235 " 240 

Pro Lys Phe Lys Arg Asp Gly Ser Asn He He Tyr Thr Ala Lys He 
245 250 255 

20 

Ser Leu Arg Glu Ala Leu Cys Gly Cys Ser He Asn Val Pro Thr Leu 
260 265 270 



25 Asp Gly Arg Asn He Pro Met Ser Val Asn Asp He Val Lys Pro Gly 
275 280 285 



Met Arg Arg Arg He He Gly Tyr Gly Leu Pro Phe Pro Lys Asn Pro 
30 290 295 300 



Asp Gin Arg Gly Asp Leu Leu He Glu Phe Glu Val Ser Phe Pro Asp 
305 310 315 320 

35 

Thr He Ser Ser Ser Ser Lys Glu Val Leu Arg Lys His Leu Pro Ala 
325 330 335 

40 

Ser 



45 <210> 9 

<211> 532 

<212> DNA 

50 

<213> Homo sapiens 



<220> 

55 

<221> CDS 

<222> (26).. (496) 
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18 



<400> 9 

gccatctgcg gtggagccgc acaaa atg cag att ttc gtg aaa acc ctt acg 52 

Met Gin lie Phe Val Lys Thr Leu Thr 
1 5 

ggg aag acc ate acc etc gag gtt gaa ccc teg gat acg ata gaa aat 100 

Gly Lys Thr He Thr Leu Glu Val Glu Pro Ser Asp Thr He Glu Asn 

10 15 20 25 

gta aag gee aag ate cag gat aag gaa gga att cct cct gat cag cag 148 
Val Lys Ala Lys He Gin Asp Lys Glu Gly He Pro Pro Asp Gin Gin 

3Q 35 *0 

• • d;q-.iir -r.i. vTt QQv aag nag ntq g^l -qg® cgt cct ■ ■ Z ' - 

Arg Leu He Phe A?* Gly Lys Gin Leu Glu Asp Gly Arg Thr Leu Ser 
45 50 55 

gac tac aat att caa aag gag tct act ctt cat ctt gtg ttg aga ctt 24 4 

20 Asp Tyr Asn He Gin Lys Glu Ser Thr Leu His Leu Val Leu Arg Leu 
60 65 70 

cgt ggt ggt get aag aaa agg aag aag aag tct tac acc act ccc aag 292 
Arg Gly Gly Ala Lys Lys Arg Lys Lys Lys Ser Tyr Thr Thr Pro Lys 
25 75 80 85 

aag aat aag cac aag aga aag aag gtt aag ctg get gtc ctg aaa tat 340 

Lys Asn Lys His Lys Arg Lys Lys Val Lys Leu Ala Val Leu Lys Tyr 

90 95 100 105 

30 

tat aag gtg gat gag aat ggc aaa att agt cgc ctt cgt cga gag tgc 388 

Tyr Lys Val Asp Glu Asn Gly Lys He Ser Arg Leu Arg Arg Glu Cys 

110 115 120 

35 cct tct gat gaa tgt ggt get ggg gtg ttt atg gca agt cac ttt gac 436 
Pro Ser Asp Glu Cys Gly Ala Gly Val Phe Met Ala Ser His Phe Asp 
125 130 135 

aga cat tat tgt ggc aaa tgt tgt ctg act tac tgt ttc aac aaa cca 484 
40 Arg His Tyr Cys Gly Lys Cys Cys Leu Thr Tyr Cys Phe Asn Lys Pro 
140 145 150 

gaa gac aag taa ctgtatgagt taataaaaga catgaactaa caaaaa 532 
Glu Asp Lys 
45 155 



<210> 10 

50 <211> 156 

<212> PRT 

<213> Homo sapiens 



55 



<400> 10 



Met Gin He Phe Val Lys Thr Leu Thr Gly Lys Thr He Thr Leu Glu 
60 1 5 10 15 
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Val Glu Pro Ser Asp Thr He Glu Asn Val Lys Ala Lys He Gin Asp 
20 25 30 

5 

Lys Glu Gly He Pro Pro Asp Gin Gin Arg Leu He Phe Ala Gly Lys 
35 40 45 

10 

Gin Leu Glu Asp Gly Arg Thr Leu Ser Asp Tyr Asn He Gin Lys Glu 
50 55 60 



V*u. \t>i- ">ra T^u t\r~ GIv 'V - ?. " r :, Y ~; i:y - Aiiy 
70 " 75" " 80 



Lys Lys Lys Ser Tyr Thr Thr Pro Lys Lys Asn Lys His Lys Arg Lys 
20 85 90 95 



Lys Val Lys Leu Ala Val Leu Lys Tyr Tyr Lys Val Asp Glu Asn Gly 
100 105 110 

25 

Lys He Ser Arg Leu Arg Arg Glu Cys Pro Ser Asp Glu Cys Gly Ala 
115 120 125 

30 

Gly Val Phe Met Ala Ser His Phe Asp Arg His Tyr Cys Gly Lys Cys 
130 135 140 



35 Cys Leu Thr Tyr Cys Phe Asn Lys Pro Glu Asp Lys 
145 150 155 



<210> 11 

40 

<211> 2608 

<212> DNA 

45 <213> Homo sapiens 

<220> 
50 <221> CDS 

<222> (180) . . (2591) 

55 <400> 11 

agctcgccgg cctttggtct ccaggacttg tcccagcagc ccctcgaact gagaattaca 60 

ccatcggacc cctggctctg aggccttcag acttggactg tgtcacactg ccaggcttcc 120 

60 agggctccaa cttgcagacg gcctgttgtg ggacagtctc tgtaatcgcg aaagcaacc 179 
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atg gaa gac ctg ggg gaa aac acc atg gtt tta tec acc ctg aga tct 227 
Met Glu Asp Leu Gly Glu Asn Thr Met Val Leu Ser Thr Leu Arg Ser 
15 10 is 

ttg aac aac ttc ate tct cag cgt gtg gag gga ggc tct gga ctg gat 275 
Leu Asn Asn Phe lie Ser Gin Arg Val Glu Gly Gly Ser Gly Leu Asp 
20 25 30 

att tct acc teg gec cca ggt tct ctg cag atg cag tac cag cag age 323 
lie Ser Thr Ser Ala Pro Gly Ser Leu Gin Met Gin Tyr Gin Gin Ser 
35 40 45 

. atg cag etc -mHq ,gaa aga gca gag cag ate cgt teg aag tec cae etc s'.-i 
■ £5. Mer. L'>v, »r-£ Ala Glu- Wr- 1 Ilk Tiro Ser t ys cfsrsr'v?.- ' 

50 55 60 

ate cag gtg gag egg gag aaa atg cag atg gag ctg agt cac aag agg 419 
He Gin Val Glu Arg Glu Lys Met Gin Met Glu Leu Ser His Lys Arg 
20 65 70 75 80 

get cga gtg gag ctg gag aga gca gec age acc agt gec agg aac tac 4 67 

Ala Arg Val Glu Leu Glu Arg Ala Ala Ser Thr Ser Ala Arg Asn Tyr 
85 90 95 

25 

gag cgt gag gtc gac cgc aac cag gag etc ctg acg cgc ate egg cag 515 
Glu Arg Glu Val Asp Arg Asn Gin Glu Leu Leu Thr Arg He Arg Gin 
100 105 110 

30 ctt cag gag egg gag gec ggg gcg gag gag aag atg cag gag cag ctg 563 
Leu Gin Glu Arg Glu Ala Gly Ala Glu Glu Lys Met Gin Glu Gin Leu 
115 120 125 

gag cgc aac agg cag tgt cag cag aac ttg gat get gec age aag agg 611 
35 Glu Arg Asn Arg Gin Cys Gin Gin Asn Leu Asp Ala Ala Ser Lys Arg 
130 135 140 

ctg cgt gag aaa gag gac agt ctg gec cag get ggc gag acc ate aac 659 
Leu Arg Glu Lys Glu Asp Ser Leu Ala Gin Ala Gly Glu Thr lie Asn 
40 145 150 155 160 

gca ctg aag ggg agg ate teg gaa ctg cag tgg age gtg atg gac cag 707 

Ala Leu Lys Gly Arg He Ser Glu Leu Gin Trp Ser Val Met Asp Gin 

165 170 175 

45 

gag atg egg gtg aag cgc ctg gag teg gag aag cag gac gtg cag gag 755 

Glu Met Arg Val Lys Arg Leu Glu Ser Glu Lys Gin Asp Val Gin Glu 
180 185 190 

50 cag ctg gac ctg caa cac aaa aaa tgc cag gaa gec aat cag aaa ate 803 
Gin Leu Asp Leu Gin His Lys Lys Cys Gin Glu Ala Asn Gin Lys He 
195 200 205 

cag gaa etc cag gee age caa gaa gca aga gca gac cac gag cag cag 851 
55 Gin Glu Leu Gin Ala Ser Gin Glu Ala Arg Ala Asp His Glu Gin Gin 
210 215 220 

att aag gat ctg gag cag aag ctg tec ctg caa gag cag gat gca gcg 8 99 

He Lys Asp Leu Glu Gin Lys Leu Ser Leu Gin Glu Gin Asp Ala Ala 
60 225 230 235 240 
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10 



att gtg aag aac atg aag tct gag ctg gta egg etc cct agg ctg gaa 947 
He Val Lys Asn Met Lys Ser Glu Leu Val Arg Leu Pro Arg Leu Glu 
245 250 255 

egg gag ctg gag cag ctg egg gag gag age gca ctg egg gag atg aga 995 
Arg Glu Leu Glu Gin Leu Arg Glu Glu Ser Ala Leu Arg Glu Met Arg 
260 265 270 

gag ace aac ggg ctg etc cag gaa gag ctg gaa ggg ctg cag agg aag 1043 
Glu Thr Asn Gly Leu Leu Gin Glu Glu Leu Glu Gly Leu Gin Arg Lys 
275 280 285 



: .ctg >ggg egc *cag gag a?, a aLg cag gag a^q cLg gtt ggc ttg gag ctg 4 ' 4091- 

290 *.j5 300 

gag aac gag agg ctg ctg gec aag ctg caa age tgg gag aga ctg gac 1139 
Glu Asn Glu Arg Leu Leu Ala Lys Leu Gin Ser Trp Glu Arg Leu Asp 
20 305 310 315 320 

cag ace atg ggc ctg age ate agg act cca gaa gac ctt tec aga ttc 1187 

Gin Thr Met Gly Leu Ser He Arg Thr Pro Glu Asp Leu Ser Arg Phe 
325 330 335 

25 

gtg gtt gag ctg cag cag agg gag ctt gec ttg aag gac aag aac age 1235 

Val Val Glu Leu Gin Gin Arg Glu Leu Ala Leu Lys Asp Lys Asn Ser 
340 345 350 

30 gec gtc ace age age gec egg ggg ctg gag aag gee agg cag cag ctg 1283 
Ala Val Thr Ser Ser Ala Arg Gly Leu Glu Lys Ala Arg Gin Gin Leu 
355 360 365 

cag gag gag etc egg cag gtc age ggc cag ctg ttg gag gag agg aag 1331 
35 Gin Glu Glu Leu Arg Gin Val Ser Gly Gin Leu Leu Glu Glu Arg Lys 
370 375 380 

aag cgc gag acc cac gag gcg ctg gec egg agg etc cag aaa egg gtc 1379 
Lys Arg Glu Thr His Glu Ala Leu Ala Arg Arg Leu Gin Lys Arg Val 
40 385 390 395 400 

ctg ctg etc acc aag gag egg gac ggt atg egg gec ate ctg ggg tec 1427 
Leu Leu Leu Thr Lys Glu Arg Asp Gly Met Arg Ala He Leu Gly Ser 
405 410 415 

45 

tac gac age gag ctg acc ccg gee gag tac tea ccc cag ctg acg egg 1475 
Tyr Asp Ser Glu Leu Thr Pro Ala Glu Tyr Ser Pro Gin Leu Thr Arg 
420 425 430 

50 cgc atg egg gag get gag gat atg gtg cag aag gtg cac age cac age 1523 
Arg Met Arg Glu Ala Glu Asp Met Val Gin Lys Val His Ser His Ser 
435 440 445 

gee gag atg gag get cag ctg teg cag gec ctg gag gag ctg gga ggc 1571 
55 Ala Glu Met Glu Ala Gin Leu Ser Gin Ala Leu Glu Glu Leu Gly Gly 
450 455 460 



cag aaa caa aga gca gac atg ctg gag atg gag ctg aag atg ctg aag 
Gin Lys Gin Arg Ala Asp Met Leu Glu Met Glu Leu Lys Met Leu Lys 
60 465 470 475 480 



1619 



WO 01/21794 



PCT/US00/25790 



10 



■22 



tct cag tec age tct gec gaa cag age ttc ctg ttc tec agg gag gag 1667 
Ser Gin Ser Ser Ser Ala Glu Gin Ser Phe Leu Phe Ser Arg Glu Glu 
485 490 495 

gcg gac acg etc agg ttg aag gtc gag gag ctg gaa ggc gag egg agt 1715 
Ala Asp Thr Leu Arg Leu Lys Val Glu Glu Leu Glu Gly Glu Arg Ser 
500 505 510 

egg ctg gag gag gaa aag agg atg ctg gag gca cag ctg gag egg cga 1763 
Arg Leu Glu Glu Glu Lys Arg Met Leu Glu Ala Gin Leu Glu Arg Arg 
515 520 525 

- get ctg ceig ggt .gac tat •gac: cag aqc agg see aaa ttg ^tg cac atg ' 

V: -^u Gin" 'G> >? Asij Tyr . fts** nxq Thr .\'-"L ;-_,*-«> - J^vf'"""- 

S"-V. 535 540 

age ctg aac ccc acc agt gtg gec agg cag cgc ctg cgc gag gac cac 1859 
Ser Leu Asn Pro Thr Ser Val Ala Arg Gin Arg Leu Arg Glu Asp His 
20 545 550 555 560 

age cag ctg cag gcg gag tgc gag cga ctg cgc ggg etc ctg cgc gec 1907 
Ser Gin Leu Gin Ala Glu Cys Glu Arg Leu Arg Gly Leu Leu Arg Ala 
565 570 575 

25 

atg gag aga gga ggc acc gtc cca gec gac ctt gag get gec gee gcg 1955 
Met Glu Arg Gly Gly Thr Val Pro Ala Asp Leu Glu Ala Ala Ala Ala 
580 585 590 

30 agt ctg cca teg tec aag gag gtg gca gag ctg aag aag cag gtg gag 2003 
Ser Le,u Pro Ser Ser Lys Glu Val Ala Glu Leu Lys Lys Gin Val Glu 
595 600 605 

agt gec gag ctg aag aac cag egg etc aag gag gtt ttc cag acc aag 2051 
35 Ser Ala Glu Leu Lys Asn Gin Arg Leu Lys Glu Val Phe Gin Thr Lys 
610 615 620 

ate cag gag ttc cgc aag gec tgc tac acg etc acc ggc tac cag ate 2099 
lie Gin Glu Phe Arg Lys Ala Cys Tyr Thr Leu Thr Gly Tyr Gin lie 
40 625 630 635 640 

gac ate acc acg gag aac cag tac egg ctg acc teg ctg tac gee gag 2147 

Asp lie Thr Thr Glu Asn Gin Tyr Arg Leu Thr Ser Leu Tyr Ala Glu 

645 650 655 

45 

cac cca ggc gac tgc tea tct tea agg cca cca gec cct egg gtt cca 2195 

His Pro Gly Asp Cys Ser Ser Ser Arg Pro Pro Ala Pro Arg Val Pro 
660 665 670 

50 aga tgc age tac tgg aga cag agt tct cac aca ccg tgg gcg age tea 2243 
Arg Cys Ser Tyr Trp Arg Gin Ser Ser His Thr Pro Trp Ala Ser Ser 
675 680 685 

teg agg tgc acc tgc ggc gee agg aca gca tec ctg cct tec tea get 2291 
55 Ser Arg Cys Thr Cys Gly Ala Arg Thr Ala Ser Leu Pro Ser Ser Ala 
690 695 700 

cgc tea ccc teg age tct tea gec gee aga ccg tgg cgt age ctg cag 2339 
Arg Ser Pro Ser Ser Ser Ser Ala Ala Arg Pro Trp Arg Ser Leu Gin 
60 705 710 715 720 
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gct egg ggg cat age egg age cac tct get tgg cct gac ctg cag gtc 2387 

Ala Arg Gly His Ser Arg Ser His Ser Ala Trp Pro Asp Leu Gin Val 
725 730 735 

5 

ccc tgc ccc gec age cac agg ctg ggt gca cgt cct gec tct cca gee 2435 

Pro Cys Pro Ala Ser His Arg Leu Gly Ala Arg Pro Ala Ser Pro Ala 
740 745 750 

10 cca cag ggc age age atg act gac aga cac get ggg acc tac gtc ggg 2483 
Pro Gin Gly Ser Ser Met Thr Asp Arg His Ala Gly Thr Tyr Val Gly 
755 760 765 



* r r 



cUt cot gct^ggg fi g£?g ^cc^ge acctctc^ticc.acr: tgc aga ^cc cat gca — 2531 

?*ro Ala ^y..a4a Ala ;>er Thr Len -Thr ■ -Cvs; i\rg Pre £la- 

/70 775 ?£u 

tec egg age ctg gtg tgt ggg cgt egg cca cca gec tgg gtt cct cac 2579 

Ser Arg Ser Leu Val Cys Gly Arg Arg Pro Pro Ala Trp Val Pro His 

20 785 790 795 800 

ctt gtg aaa taa aatcttctcc cctaaaa 2608 
Leu Val Lys 



25 



30 



35 



40 



45 



<210> 12 

<211> 803 

<212> PRT 

<213> Homo sapiens 

<400> 12 

Met Glu Asp Leu Gly Glu Asn Thr Met Val Leu Ser Thr Leu Arg Ser 
15 10 15 



Leu Asn Asn Phe lie Ser Gin Arg Val Glu Gly Gly Ser Gly Leu Asp 
20 25 30 



lie Ser Thr Ser Ala Pro Gly Ser Leu Gin Met Gin Tyr Gin Gin Ser 
35 40 45 



50 Met Gin Leu Glu Glu Arg Ala Glu Gin He Arg Ser Lys Ser His Leu 
50 55 60 



He Gin Val Glu Arg Glu Lys Met Gin Met Glu Leu Ser His Lys Arg 
55 65 70 75 80 



Ala Arg Val Glu Leu Glu Arg Ala Ala Ser Thr Ser Ala Arg Asn Tyr 
85 90 95 

60 
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Glu Arg Glu Val Asp Arg Asn Gin Glu Leu Leu Thr Arg lie Arg Gin 
100 105 HO 

5 

Leu Gin Glu Arg Glu Ala Gly Ala Glu Glu Lys Met Gin Glu Gin Leu 
115 120 125 

10 Glu Arg Asn Arg Gin Cys Gin Gin Asn Leu Asp Ala Ala Ser Lys Arg 
130 135 140 



Giu Lys £±u Asp/, Ser .Leu Ala : Gin Ala Gly Glu Tr.; XI- 

- - ' ' - ir>o .>.v : .'• ■ . . - .: ibo 

Ala Leu Lys Gly Arg He Ser Glu Leu Gin Trp Ser Val Met Asp Gin 
165 170 175 

20 

Glu Met Arg Val Lys Arg Leu Glu Ser Glu Lys Gin Asp Val Gin Glu 
180 185 190 

25 

Gin Leu Asp Leu Gin His Lys Lys Cys Gin Glu Ala Asn Gin Lys He 
195 200 205 



30 Gin Glu Leu Gin Ala Ser Gin Glu Ala Arg Ala Asp His Glu Gin Gin 
210 215 220 



He Lys Asp Leu Glu Gin Lys Leu Ser Leu Gin Glu Gin Asp Ala Ala 
35 225 230 235 240 



He Val Lys Asn Met Lys Ser Glu Leu Val Arg Leu Pro Arg Leu Glu 
245 250 255 

40 

Arg Glu Leu Glu Gin Leu Arg Glu Glu Ser Ala Leu Arg Glu Met Arg 
260 265 270 

45 

Glu Thr Asn Gly Leu Leu Gin Glu Glu Leu Glu Gly Leu Gin Arg Lys 
275 280 285 



50 Leu Gly Arg Gin Glu Lys Met Gin Glu Thr Leu Val Gly Leu Glu Leu 
290 295 300 



Glu Asn Glu Arg Leu Leu Ala Lys Leu Gin Ser Trp Glu Arg Leu Asp 
55 305 310 315 320 



Gin Thr Met Gly Leu Ser He Arg Thr Pro Glu Asp Leu Ser Arg Phe 
325 330 335 

60 
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Val Val Glu Leu Gin Gin Arg Glu Leu Ala Leu Lys Asp Lys Asn Ser 
340 345 350 



Ala Val Thr Ser Ser Ala Arg Gly Leu Glu Lys Ala Arg Gin Gin Leu 
355 360 365 



10 Gin Glu Glu Leu Arg Gin Val Ser Gly Gin Leu Leu Glu Glu Arg Lys 
370 375 380 



:W.Lys.> Arg. u Thr Hia Glu Aia Leu Ala Arg Arg Leu Gin Lys Mg Vai 

^ ^ 3SG . - 395 OC 



20 



Leu Leu Leu Thr Lys Glu Arg Asp Gly Met Arg Ala lie Leu Gly Ser 
405 410 415 



25 



Tyr Asp Ser Glu Leu Thr Pro Ala Glu Tyr Ser Pro Gin Leu Thr Arg 
420 425 430 



Arg Met Arg Glu Ala Glu Asp Met Val Gin Lys Val His Ser His Ser 
435 440 445 



30 Ala Glu Met Glu Ala Gin Leu Ser Gin Ala Leu Glu Glu Leu Gly Gly 
450 455 460 



Gin Lys Gin Arg Ala Asp Met Leu Glu Met Glu Leu Lys Met Leu Lys 
35 465 470 475 480 



40 



Ser Gin Ser Ser Ser Ala Glu Gin Ser Phe Leu Phe Ser Arg Glu Glu 
485 490 495 



45 



Ala Asp Thr Leu Arg Leu Lys Val Glu Glu Leu Glu Gly Glu Arg Ser 
500 505 510 



Arg Leu Glu Glu Glu Lys Arg Met Leu Glu Ala Gin Leu Glu Arg Arg 
515 520 525 



50 Ala Leu Gin Gly Asp Tyr Asp Gin Ser Arg Thr Lys Val Leu His Met 
530 535 540 



Ser Leu Asn Pro Thr Ser Val Ala Arg Gin Arg Leu Arg Glu Asp His 
55 545 550 555 560 



60 



Ser Gin Leu Gin Ala Glu Cys Glu Arg Leu Arg Gly Leu Leu Arg Ala 
565 570 575 
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Met Glu Arg Gly Gly Thr Val Pro Ala Asp Leu Glu Ala Ala Ala Ala 
580 585 590 

5 

Ser Leu Pro Ser Ser Lys Glu Val Ala Glu Leu Lys Lys Gin Val Glu 
595 600 605 



10 Ser Ala Glu Leu Lys Asn Gin Arg Leu Lys Glu Val Phe Gin Thr Lys 
610 615 620 



. ^ie :Gln .Glu >Phe Arc* .Lys Ala Cys Tyr Thr Leu Thr Gly xyr Gin lie ^ 



Asp He Thr Thr Glu Asn Gin Tyr Arg Leu Thr Ser Leu Tyr Ala Glu 
645 650 655 

20 

His Pro Gly Asp Cys Ser Ser Ser Arg Pro Pro Ala Pro Arg Val Pro 
660 665 670 

25 

Arg Cys Ser Tyr Trp Arg Gin Ser Ser His Thr Pro Trp Ala Ser Ser 
675 680 685 



30 Ser Arg Cys Thr Cys Gly Ala Arg Thr Ala Ser Leu Pro Ser Ser Ala 
690 695 700 



Arg Ser Pro Ser Ser Ser Ser Ala Ala Arg Pro Trp Arg Ser Leu Gin 
35 705 710 715 720 



Ala Arg Gly His Ser Arg Ser His Ser Ala Trp Pro Asp Leu Gin Val 
725 730 735 

40 

Pro Cys Pro Ala Ser His Arg Leu Gly Ala Arg Pro Ala Ser Pro Ala 
740 745 750 

45 

Pro Gin Gly Ser Ser Met Thr Asp Arg His Ala Gly Thr Tyr Val Gly 
755 760 765 



50 Leu Pro Ala Gly Ala Ala Ser Thr Leu Ser Thr Cys Arg Pro His Ala 
770 775 780 



Ser Arg Ser Leu Val Cys Gly Arg Arg Pro Pro Ala Trp Val Pro His 
55 785 790 795 800 



Leu Val Lys 

60 
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<210> 13 

<211> 2965 

<212> DNA 

<213> Homo sapiens 

<220> 



<221> CDS 
?22> (XT: . . ;;:v; ;•, 



- <40O> 13 

gcggaggtcg gcggtcgggt ccgtctctgc ccgcggctgt ggcggcgccg gcggatccag 60 

ccttagcgtt cctctctggg cggcggcggc ggcggctcgg ttgacgcctc ctccgccagc 120 

tgagcccgcg ggagcccagg acgccgcttc cccgcccatc cccgctcccc gaggccggcc 180 

gcctggtc atg gcg cag ccg ggc ccg get tec cag cct gac gtt tct ctt 230 
Met Ala Gin Pro Gly Pro Ala Ser Gin Pro Asp Val Ser Leu 
15 10 

cag caa egg gta gca gaa ttg gaa aaa att aat gca gaa ttt tta cgt 278 
Gin Gin Arg Val Ala Glu Leu Glu Lys lie Asn Ala Glu Phe Leu Arg 
15 20 25 30 

gca caa cag cag ctt gaa caa gaa ttt aat caa aag aga gca aaa ttt 326 
Ala Gin Gin Gin Leu Glu Gin Glu Phe Asn Gin Lys Arg Ala Lys Phe 
35 40 45 

aag gag tta tat ttg get aaa gag gag gat ctg aag agg caa aat gca 374 
Lys Glu Leu Tyr Leu Ala Lys Glu Glu Asp Leu Lys Arg Gin Asn Ala 
50 55 60 

gta tta caa get gca caa gat gat ttg gga cac ctt cga acc cag ctg 422 
Val Leu Gin Ala Ala Gin Asp Asp Leu Gly His Leu Arg Thr Gin Leu 
65 70 75 

tgg gaa get caa gca gag atg gag aat att aag gcg att gec aca gtc 470 
Trp Glu Ala Gin Ala Glu Met Glu Asn He Lys Ala He Ala Thr Val 
80 85 90 

tct gag aac acc aag caa gaa get ata gat gaa gtg aaa aga cag tgg 518 
Ser Glu Asn Thr Lys Gin Glu Ala He Asp Glu Val Lys Arg Gin Trp 
.95 100 105 110 

aga gaa gaa gtt get tea ctt cag get gtt atg aaa gaa aca gtt cgt 566 
Arg Glu Glu Val Ala Ser Leu Gin Ala Val Met Lys Glu Thr Val Arg 
115 120 125 

gac tat gag cac cag ttc cac ctt agg ctg gag cag gag cga aca cag 614 
Asp Tyr Glu His Gin Phe His Leu Arg Leu Glu Gin Glu Arg Thr Gin 
130 135 140 
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tgg gca cag tat aga gaa tac gca gag agg gaa ata get gat tta aga 662 
Trp Ala Gin Tyr Arg Glu Tyr Ala Glu Arg Glu He Ala Asp Leu Arg 
145 150 155 

5 aga agg ctg tct gaa ggt caa gag gag gaa aat tta gaa aat gaa atg 710 
Arg Arg Leu Ser Glu Gly Gin Glu Glu Glu Asn Leu Glu Asn Glu Met 
160 165 170 

aaa aag gec caa gag gat get gag aaa ctt egg tec gtt gtg atg cca 758 
10 Lys Lys Ala Gin Glu Asp Ala Glu Lys Leu Arg Ser Val Val Met Pro 
175 180 185 190 

ata gaa aag gaa att gca get ttg aag gat aaa ctg aca gaq get gaa 806 

. bte t C I u Lys ,Glu , lie Ala Ala Jjeu Lys Asp ,L~j £ T..au Tnx «lu A.Ta Glu 'r * 
15 ' . 195 - ■ ' ^00^" . - ■ si)5 '.' " 

gac aaa att aaa gag ctg gag gec tea aag gtt aaa gaa ctg aat cat 854 

Asp Lys He Lys Glu Leu Glu Ala Ser Lys Val Lys Glu Leu Asn His 

210 215 220 

20 

tat ctg gaa get gag aaa tct tgt agg act gat eta gag atg tat gta 902 

Tyr Leu Glu Ala Glu Lys Ser Cys Arg Thr Asp Leu Glu Met Tyr Val 

225 230 235 

25 get gtt ttg aat act cag aaa tct gtt eta cag gaa gat get gag aaa 950 
Ala Val Leu Asn Thr Gin Lys Ser Val Leu Gin Glu Asp Ala Glu Lys 
240 245 250 

ctg egg aaa gaa ttg cat gaa gtt tgc cat etc ttg gag caa gag cga 998 
30 Leu Arg Lys Glu Leu His Glu Val Cys His Leu Leu Glu Gin Glu Arg 

255 260 265 270 ' 

caa caa cac aac cag tta aaa cat acg tgg cag aag gee aat gac cag 1046 
Gin Gin His Asn Gin Leu Lys His Thr Trp Gin Lys Ala Asn Asp Gin 
35 275 280 285 

ttt ctg gaa tct cag cgt tta ctg atg aga gac atg cag cga atg gag 1094 

Phe Leu Glu Ser Gin Arg Leu Leu Met Arg Asp Met Gin Arg Met Glu 
290 295 300 

40 

att gtg eta act tea gaa cag etc cga caa gtt gaa gaa ctg aag aag 1142 

lie Val Leu Thr Ser Glu Gin Leu Arg Gin Val Glu Glu Leu Lys Lys 
305 310 315 

45 aaa gat cag gag gat gat gaa caa caa aga etc aat aag aga aag gat 1190 
Lys Asp Gin Glu Asp Asp Glu Gin Gin Arg Leu Asn Lys Arg Lys Asp 
320 325 330 

cac aaa aaa gca gat gtt gag gaa gaa ata aaa ata cca gta gtg tgt 1238 
50 His Lys Lys Ala Asp Val Glu Glu Glu He Lys He Pro Val Val Cys 
335 340 345 350 

get tta act caa gaa gaa tct tea gee cag tta tea aat gaa gag gag 1286 
Ala Leu Thr Gin Glu Glu Ser Ser Ala Gin Leu Ser Asn Glu Glu Glu 
55 355 360 365 

cat tta gac age acc cgt ggc tea gtt cat tec tta gat gca ggc ttg 1334 
His Leu Asp Ser Thr Arg Gly Ser Val His Ser Leu Asp Ala Gly Leu 
370 375 380 

60 
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10 



20 



25 



ctg ttg cca tct gga gat cct ttc agt aaa teg gac aat gac atg ttt 1382 
Leu Leu Pro Ser Gly Asp Pro Phe Ser Lys Ser Asp Asn Asp Met Phe 
385 390 395 

aaa gat gga etc agg aga gca cag tct aca gac age ttg gga acc teg 1430 
Lys Asp Gly Leu Arg Arg Ala Gin Ser Thr Asp Ser Leu Gly Thr Ser 
400 405 410 



1478 



ggc tea ttg caa tec aaa get tta ggc tat aac tac aaa gca aaa tct 
Gly Ser Leu Gin Ser Lys Ala Leu Gly Tyr Asn Tyr Lys Ala Lys Ser 
415 420 425 430 

ryct gga aac ctg gac gag tr.a gat ttt gga cca ctg gt?„ gga gca gat 
.-Vis Gly Asii Leu Asp Glu Ser Asp hie Gly Pro Leu Val GJLy ;ila Aay 
■ '••'^ : "' 4 4 •. * ' .• • 4 1 

tea gtg tct gag aac ttt gat act gca tec ctt ggg tea etc cag atg 1574 
Ser Val Ser Glu Asn Phe Asp Thr Ala Ser Leu Gly Ser Leu Gin Met 
450 455 460 

cca agt ggg ttt atg tta acc aaa gat cag gaa aga gca ate aag gcg 1622 
Pro Ser Gly Phe Met Leu Thr Lys Asp Gin Glu Arg Ala lie Lys Ala 
465 470 475 

atg aca cca gaa caa gaa gag aca gcg tec etc etc tec age gtt acc 1670 
Met Thr Pro Glu Gin Glu Glu Thr Ala Ser Leu Leu Ser Ser Val Thr 
480 485 490 



cag ggc atg gag agt gec tat gtg tec cct agt ggt tat cgt tta gtt 1718 
30 Gin Gly Met Glu Ser Ala Tyr Val Ser Pro Ser Gly Tyr Arg Leu Val 
495 500 505 510 

agt gaa aca gaa tgg aat etc ttg cag aaa gag gta cat aat get gga 1766 
Ser Glu Thr Glu Trp Asn Leu Leu Gin Lys Glu Val His Asn Ala Gly 
35 515 520 525 

aat aaa ctt ggt aga cgt tgt gat atg tgt tec aat tac gaa aaa cag 1814 

Asn Lys Leu Gly Arg Arg Cys Asp Met Cys Ser Asn Tyr Glu Lys Gin 

530 535 540 

40 

tta caa gga att cag att cag gag get gaa acg aga gac cag gtg aaa 1862 

Leu Gin Gly He Gin He Gin Glu Ala Glu Thr Arg Asp Gin Val Lys 

545 550 555 

45 aaa eta cag ctg atg eta agg caa get aat gac cag tta gag aag aca 1910 
Lys Leu Gin Leu Met Leu Arg Gin Ala Asn Asp Gin Leu Glu Lys Thr 
560 565 570 

atg aaa gat aag cag gag ctg gaa gac ttc ata aag caa age age gaa 1958 
50 Met Lys Asp Lys Gin Glu Leu Glu Asp Phe He Lys Gin Ser Ser Glu 
575 580 585 590 

gat teg agt cac cag ate tct gca etc gtc eta aga gec cag gee tec 2006 
Asp Ser Ser His Gin He Ser Ala Leu Val Leu Arg Ala Gin Ala Ser 
55 595 600 605 

gag ate tta ctt gaa gag tta cag cag ggg ctt tec cag gca aag agg 2054 
Glu He Leu Leu Glu Glu Leu Gin Gin Gly Leu Ser Gin Ala Lys Arg 



60 



610 615 620 
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gat gtt cag gaa cag atg gcg gtg ctg atg cag tea egg gaa cag gtt 2102 
Asp Val Gin Glu Gin Met Ala Val Leu Met Gin Ser Arg Glu Gin Val 
625 630 635 

5 tea gaa gag ctg gtg agg tta cag aaa gat aat gac agt etc cag gga 2150 
Ser Glu Glu Leu Val Arg Leu Gin Lys Asp Asn Asp Ser Leu Gin Gly 
640 645 650 

aag cac age ctg cat gtg tea tta cag caa gca gaa gac ttc ate etc 2198 
10 Lys His Ser Leu His Val Ser Leu Gin Gin Ala Glu Asp Phe He Leu 
655 660 665 670 

era gac act aca gar gcr» ctg ~rtg gag ttcj qtc. tta'ae?; t : L gi" g=»g 2246 
: < Pro Asp -Thr Thr Griu Alr». l»bo Arg Glu Leu Val Leu- Lys Tyr Arq Gi ? 

>-> • ■ s'rj ' cab- • * 

gac ate att aat gtg egg aca gca gca gac cac gta gaa gaa aag ctg 2294 

Asp He He Asn Val Arg Thr Ala Ala Asp His Val Glu Glu Lys Leu 

690 695 700 

20 

aag get gag at a ctt ttc eta aaa gag cag ate caa gca gaa cag tgt 2342 

Lys Ala Glu He Leu Phe Leu Lys Glu Gin He Gin Ala Glu Gin Cys 
705 710 715 

25 tta aaa gaa aat ctt gaa gaa act ctg caa eta gaa ata gaa aac tgc 2390 
Leu Lys Glu Asn Leu Glu Glu Thr Leu Gin Leu Glu He Glu Asn Cys 
720 725 730 

aag gag gaa ata get tct att tct age eta aaa get gaa tta gaa aga 2438 
30 Lys Glu Glu He Ala Ser He Ser Ser Leu Lys Ala Glu Leu Glu Arg 
735 740 745 750 

ata aaa gtg gaa aaa gga cag ttg gag tec aca tta aga gag aag tct 24 86 
He Lys Val Glu Lys Gly Gin Leu Glu Ser Thr Leu Arg Glu Lys Ser 
35 755 ' 760 765 

caa cag ctt gag agt ctt cag gaa ata aag ate agt ttg gaa gag cag 2534 

Gin Gin Leu Glu Ser Leu Gin Glu He Lys He Ser Leu Glu Glu Gin 
770 775 780 

40 

tta aag aaa gag act get get aag get ace gtt gaa cag eta atg ttt 2582 

Leu Lys Lys Glu Thr Ala Ala Lys Ala Thr Val Glu Gin Leu Met Phe 
785 790 795 

45 gaa gag aag aac aaa get cag aga tta cag aca gaa tta gat gtc agt 2630 
Glu Glu Lys Asn Lys Ala Gin Arg Leu Gin Thr Glu Leu Asp Val Ser 
800 805 810 

gag caa gtc cag aga gat ttt gta aag ctt tea cag acc ctt cag gtg 2678 
50 Glu Gin Val Gin Arg Asp Phe Val Lys Leu Ser Gin Thr Leu Gin Val 
815 820 825 830 

cag tta gag egg ate egg caa get gac tec ttg gag aga ate egg gca 2726 
Gin Leu Glu Arg He Arg Gin Ala Asp Ser Leu Glu Arg He Arg Ala 
55 835 840 845 

att ctg aat gat act aaa ctg aca gac att aac cag ctt cct gag aca 2774 

He Leu Asn Asp Thr Lys Leu Thr Asp He Asn Gin Leu Pro Glu Thr 
850 855 860 

60 
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tga caccctcatg gcaggattct agcctgcact ttgggttttt aactcatctt 2827 

tagagcaaca gtaattatta tttaactctt aactgaagaa agagaagtca caacaaaagg 2887 

5 aagactggag aaatgcttac ttctagaggg agaagactgt gcggcacagg aaacagcaaa 2947 

cagtggggtg atctgcag 2965 

10 <210> 14 
<211> 862 

-V. <212> ! PRT; > - - V ■ .■■ i/:fv.^.. ...Kwyv: - v : , . ^ 

<213> Homo sapiens 
<400> 14 

20 

Met Ala Gin Pro Gly Pro Ala Ser Gin Pro Asp Val Ser Leu Gin Gin 
15 10 15 

25 Arg Val Ala Glu Leu Glu Lys lie Asn Ala Glu Phe Leu Arg Ala Gin 
20 25 30 

Gin Gin Leu Glu Gin Glu Phe Asn Gin Lys Arg Ala Lys Phe Lys Glu 
30 35 40 45 

Leu Tyr Leu Ala Lys Glu Glu Asp Leu Lys Arg Gin Asn Ala Val Leu 
50 55 60 

35 

Gin Ala Ala Gin Asp Asp Leu Gly His Leu Arg Thr Gin Leu Trp Glu 
65 70 75 80 

40 

Ala Gin Ala Glu Met Glu Asn lie Lys Ala lie Ala Thr Val Ser Glu 
85 90 95 

45 Asn Thr Lys Gin Glu Ala lie Asp Glu Val Lys Arg Gin Trp Arg Glu 
100 105 110 

Glu Val Ala Ser Leu Gin Ala Val Met Lys Glu Thr Val Arg Asp Tyr 
50 115 120 125 



Glu His Gin Phe His Leu Arg Leu Glu Gin Glu Arg Thr Gin Trp Ala 
130 135 140 

55 

Gin Tyr Arg Glu Tyr Ala Glu Arg Glu lie Ala Asp Leu Arg Arg Arg 
145 150 155 160 



60 
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Leu Ser Glu Gly Gin Glu Glu Glu Asn Leu Glu Asn Glu Met Lys Lys 
165 170 175 

5 Ala Gin Glu Asp Ala Glu Lys Leu Arg Ser Val Val Met Pro Met Glu 
180 185 190 



Lys Glu lie Ala Ala Leu Lys Asp Lys Leu Thr Glu Ala Glu Asp Lys 
10 195 200 205 



T?e Lys G. 1 v Leu Glu Ala Ser Ly.«r Val L/s Glu Leu *-3r. V J ? i,^c 
'210 • . .., > ■ ■ 215 •• , /' 220 ' ' " ~ ^ 



Glu Ala Glu Lys Ser Cys Arg Thr Asp Leu Glu Met Tyr Val Ala Val 
225 230 235 240 



20 



Leu Asn Thr Gin Lys Ser Val Leu Gin Glu Asp Ala Glu Lys Leu Arg 
245 250 255 



25 Lys Glu Leu His Glu Val Cys His Leu Leu Glu Gin Glu Arg Gin Gin 
260 265 270 



His Asn Gin Leu Lys His Thr Trp Gin Lys Ala Asn Asp Gin Phe Leu 
30 275 280 285 



35 



Glu Ser Gin Arg Leu Leu Met Arg Asp Met Gin Arg Met Glu lie Val 
290 295 300 



40 



Leu Thr Ser Glu Gin Leu Arg Gin Val Glu Glu Leu Lys Lys Lys Asp 
305 310 315 320 



Gin Glu Asp Asp Glu Gin Gin Arg Leu Asn Lys Arg Lys Asp His Lys 
325 330 335 



45 Lys Ala Asp Val Glu Glu Glu He Lys He Pro Val Val Cys Ala Leu 
340 345 350 



Thr Gin Glu Glu Ser Ser Ala Gin Leu Ser Asn Glu Glu Glu His Leu 
50 355 360 365 



Asp Ser Thr Arg Gly Ser Val His Ser Leu Asp Ala Gly Leu Leu Leu 
370 375 380 

55 



Pro Ser Gly Asp Pro Phe Ser Lys Ser Asp Asn Asp Met Phe Lys Asp 
385 390 395 400 



60 
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Gly Leu Arg Arg Ala Gin Ser Thr Asp Ser Leu Gly Thr Ser Gly Ser 
405 410 415 

5 Leu Gin Ser Lys Ala Leu Gly Tyr Asn Tyr Lys Ala Lys Ser Ala Gly 
420 425 430 



Asn Leu Asp Glu Ser Asp Phe Gly Pro Leu Val Gly Ala Asp Ser Val 
10 435 440 445 



Ser . Glr. .Asn Phe Asp Thr Ala Ser T>ev Glv Ser Leu Gin Mp*- n r r. ^or 
450 455 4SC ' -."•/• 



Gly Phe Met Leu Thr Lys Asp Gin Glu Arg Ala lie Lys Ala Met Thr 
465 470 475 480 

20 

Pro Glu Gin Glu Glu Thr Ala Ser Leu Leu Ser Ser Val Thr Gin Gly 
485 490 495 



25 Met Glu Ser Ala Tyr Val Ser Pro Ser Gly Tyr Arg Leu Val Ser Glu 
500 505 510 



Thr Glu Trp Asn Leu Leu Gin Lys Glu Val His Asn Ala Gly Asn Lys 
30 515 520 525 



Leu Gly Arg Arg Cys Asp Met Cys Ser Asn Tyr Glu Lys Gin Leu Gin 
530 535 540 

35 

Gly lie Gin He Gin Glu Ala Glu Thr Arg Asp Gin Val Lys Lys Leu 
545 550 555 . 560 

40 

Gin Leu Met Leu Arg Gin Ala Asn Asp Gin Leu Glu Lys Thr Met Lys 
565 570 575 



45 Asp Lys Gin Glu Leu Glu Asp Phe He Lys Gin Ser Ser Glu Asp Ser 
580 585 590 



Ser His Gin He Ser Ala Leu Val Leu Arg Ala Gin Ala Ser Glu He 
50 595 600 605 



Leu Leu Glu Glu Leu Gin Gin Gly Leu Ser Gin Ala Lys Arg Asp Val 
610 615 620 

55 

Gin Glu Gin Met Ala Val Leu Met Gin Ser Arg Glu Gin Val Ser Glu 
625 630 635 640 



60 
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Glu Leu Val Arg Leu Gin Lys Asp Asn Asp Ser Leu Gin Gly Lys His 
645 650 655 

5 Ser Leu His Val Ser Leu Gin Gin Ala Glu Asp Phe lie Leu Pro Asp 
660 665 670 

Thr Thr Glu Ala Leu Arg Glu Leu Val Leu Lys Tyr Arg Glu Asp He 
10 675 680 685 

He Asr> ,Val Arc? T^r Ala A.la Asp wis V*l .Glu Glu Leu hys A 1 - 

690 - . 69b . 700 

Glu He Leu Phe Leu Lys Glu Gin He Gin Ala Glu Gin Cys Leu Lys 
705 710 715 720 

20 

Glu Asn Leu Glu Glu Thr Leu Gin Leu Glu lie Glu Asn Cys Lys Glu 
725 730 735 

25 Glu He Ala Ser He Ser Ser Leu Lys Ala Glu Leu Glu Arg He Lys 
740 745 750 

Val Glu Lys Gly Gin Leu Glu Ser Thr Leu Arg Glu Lys Ser Gin Gin 
30 755 760 765 



Leu Glu Ser Leu Gin Glu He Lys He Ser Leu Glu Glu Gin Leu Lys 
770 775 780 

35 

Lys Glu Thr Ala Ala Lys Ala Thr Val Glu Gin Leu Met Phe Glu Glu 
785 790 795 800 

40 

Lys Asn Lys Ala Gin Arg Leu Gin Thr Glu Leu Asp Val Ser Glu Gin 
805 810 815 



45 Val Gin Arg Asp Phe Val Lys Leu Ser Gin Thr Leu Gin Val Gin Leu 
820 825 830 



Glu Arg He Arg Gin Ala Asp Ser Leu Glu Arg He Arg Ala He Leu 
50 835 840 845 



Asn Asp Thr Lys Leu Thr Asp He Asn Gin Leu Pro Glu Thr 
850 855 860 

55 

<210> 15 



<2H> 1330 

60 
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35 



<212> DNA 

<213> Homo sapiens 

5 

<220> 

<221> CDS 

10 <222> (145).. (1278) 



20 



<400> 15 

aattcccaa^ tqi*ccti-tta tt Lcat&cag. agatacaaag gcaactatgt gcagcaacaa 

tctgatgggc a^ccaaact cttgggagga agtaoa^* tggtaaatgt catgatggcg 

gtcgggaggg aggaaggtgg caag atg gtg ttg gaa age act atg gtg tgt 

Met Val Leu Glu Ser Thr Met Val Cys 
1 5 



^20 
171 



25 



gtg gac aac agt gag tat atg egg aat gga gac ttc tta ccc acc agg 

Val Asp Asn Ser Glu Tyr Met Arg Asn Gly Asp Phe Leu Pro Thr Arg 
10 15 20 25 

ctg cag gec cag cag gat get gtc aac ata gtt tgt cat tea aag acc 

Leu Gin Ala Gin Gin Asp Ala Val Asn He Val Cys His Ser Lys Thr 
30 35 40 



219 



267 



30 cgc age aac cct gag aac aac gtg ggc ctt ate aca ctg get aat gac 
Arg Ser Asn Pro Glu Asn Asn Val Gly Leu He Thr Leu Ala Asn Asp 
45 50 55 



315 



tgt gaa gtg ctg acc aca etc acc cca gac act ggc cgt ate ctg tec 
35 Cys Glu Val Leu Thr Thr Leu Thr Pro Asp Thr Gly Arg He Leu Ser 
60 65 70 



363 



40 



aag eta cat act gtc caa ccc aag ggc aag ate acc ttc tgc acg ggc 
Lys Leu His Thr Val Gin Pro Lys Gly Lys He Thr Phe Cys Thr Gly 
75 80 85 



411 



45 



ate cgc gtg gee cat ctg get ctg aag cac cga caa ggc aag aat cac 459 
He Arg Val Ala His Leu Ala Leu Lys His Arg Gin Gly Lys Asn His 
90 95 100 105 

aag atg cgc ate att gec ttt gtg gga age cca gtg gag gac aat gag 507 
Lys Met Arg He He Ala Phe Val Gly Ser Pro Val Glu Asp Asn Glu 
110 115 120 



50 aag gat ctg gtg aaa ctg get aaa cgc etc aag aag gag aaa gta aat 
Lys Asp Leu Val Lys Leu Ala Lys Arg Leu Lys Lys Glu Lys Val Asn 
125 130 135 



555 



gtt gac att ate aat ttt ggg gaa gag gag gtg aac aca gaa aag ctg 
55 Val Asp He He Asn Phe Gly Glu Glu Glu Val Asn Thr Glu Lys Leu 
140 145 150 



603 



60 



aca gec ttt gta aac acg ttg aat ggc aaa gat gga acc ggt tct cat 
Thr Ala Phe Val Asn Thr Leu Asn Gly Lys Asp Gly Thr Gly Ser His 
155 160 165 



651 
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ctg gtg aca gtg cct cct ggg ccc agt ttg get gat get etc ate agt 699 

Leu Val Thr Val Pro Pro Gly Pro Ser Leu Ala Asp Ala Leu He Ser 

170 175 190 185 

5 

tct ccg att ttg get ggt gaa ggt ggt gec atg ctg ggt ctt ggt gee 747 

Ser Pro He Leu Ala Gly Glu Gly Gly Ala Met Leu Gly Leu Gly Ala 
190 195 200 

10 agt gac ttt gaa ttt gga gta gat ccc agt get gat cct gag ctg gee 795 
Ser Asp Phe Glu Phe Gly Val Asp Pro Ser Ala Asp Pro Glu Leu Ala 
205 210 215 

u >i t^ittg^-gcclJct.ti -rCcri. gta t^t atg q>a g^g cag egg cag cggHcaq fag $a<j 3 « ? 

■ > ' Leu AXa :-e ; .: 5 Val be? ™et Gin Arc ^J.u rv— 

220 225 " 230 

gag gee egg egg gca get gca get tct get get gag gec ggg att get 891 
Glu Ala Arg Arg Ala Ala Ala Ala Ser Ala Ala Glu Ala Gly He Ala 
20 235 240 245 

acg act ggg act gaa gac tea gac gat gec ctg ctg aag atg acc ate 939 

Thr Thr Gly Thr Glu Asp Ser Asp Asp Ala Leu Leu Lys Met Thr He 
250 255 260 265 

25 

age cag caa gag ttt ggc cgc act ggg ctt cct gac eta age agt atg 987 

Ser Gin Gin Glu Phe Gly Arg Thr Gly Leu Pro Asp Leu Ser Ser Met 
270 275 280 

30 act gag gaa gag cag att get tat gec atg cag atg tec ctg cag gga 1035 
Thr Glu Glu Glu Gin He Ala Tyr Ala Met Gin Met Ser Leu Gin Gly 
285 290 295 

gca gag ttt ggc cag gcg gaa tea gca gac att gat gee age tea get 1083 
35 Ala Glu Phe Gly Gin Ala Glu Ser Ala Asp He Asp Ala Ser Ser Ala 
300 305 310 

atg gac aca tec gag cca gec aag gag gag gat gat tac gac gtg atg 1131 
Met Asp Thr Ser Glu Pro Ala Lys Glu Glu Asp Asp Tyr Asp Val Met 
40 315 320 325 

cag gac ccc gag ttc ctt cag agt gtc eta gag aac etc cca ggt gtg 1179 

Gin Asp Pro Glu Phe Leu Gin Ser Val Leu Glu Asn Leu Pro Gly Val 

330 335 340 345 

45 

gat ccc aac aat gaa gec att cga aat get atg ggc tec ctg gec tec 1227 

Asp Pro Asn Asn Glu Ala He Arg Asn Ala Met Gly Ser Leu Ala Ser 
350 355 360 

50 cag gec acc aag gac ggc aag aag gac aag aag gag gaa gac aag aag 1275 
Gin Ala Thr Lys Asp Gly Lys Lys Asp Lys Lys Glu Glu Asp Lys Lys 
365 370 375 



55 



tga gactggaggg aaagggtagc tgagtctget tagggactgc atgggggaat tc 1330 



<210> 16 



<211> 377 

60 
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<212> PRT 

<213> Homo sapiens 

5 

<400> 16 

Met Val Leu Glu Ser Thr Met Val Cys Val Asp Asn Ser Glu Tyr Met 
15 10 15 

10 

Arg Asn Gly Asp Phe Leu Pro Thr Arg Leu Gin Ala Gin Gin Asp Ala 

20 25 .iO 

v -.-- Asn lie Val Cys His Sox Lys Thr Arg Ser Asn Pro Gjlu Asn 
35 40 45 



20 Val Gly Leu lie Thr Leu Ala Asn Asp Cys Glu Val Leu Thr Thr Leu 
50 55 60 



Thr Pro Asp Thr Gly Arg lie Leu Ser Lys Leu His Thr Val Gin Pro 
25 65 70 75 80 



Lys Gly Lys lie Thr Phe Cys Thr Gly lie Arg Val Ala His Leu Ala 
85 90 95 

30 

Leu Lys His Arg Gin Gly Lys Asn His Lys Met Arg lie lie Ala Phe 
100 105 110 

35 

Val Gly Ser Pro Val Glu Asp Asn Glu Lys Asp Leu Val Lys Leu Ala 
115 120 125 



40 Lys Arg Leu Lys Lys Glu Lys Val Asn Val Asp lie lie Asn Phe Gly 
130 135 140 



Glu Glu Glu Val Asn Thr Glu Lys Leu Thr Ala Phe Val Asn Thr Leu 
45 145 150 155 160 



Asn Gly Lys Asp Gly Thr Gly Ser His Leu Val Thr Val Pro Pro Gly 
165 170 175 

50 

Pro Ser Leu Ala Asp Ala Leu lie Ser Ser Pro lie Leu Ala Gly Glu 
180 185 190 

55 

Gly Gly Ala Met Leu Gly Leu Gly Ala Ser Asp Phe Glu Phe Gly Val 
195 200 205 



60 Asp Pro Ser Ala Asp Pro Glu Leu Ala Leu Ala Leu Arg Val Ser Met 
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210 



215 



220 



Glu Glu Gin Arg Gin Arg Gin Glu Glu Glu Ala Arg Arg Ala Ala Ala 
5 225 230 235 240 



10 



Ala Ser Ala Ala Glu Ala Gly lie Ala Thr Thr Gly Thr Glu Asp Ser 
245 250 255 



Asp Asp Ala Leu Leu Lys Met Thr lie Ser Gin Gin Glu Phe Gly Arg 

2 60 265 270 



Thr Gly Leu Pro A~p Leu Ser Ser Met Thr Glu Glu Glu Gin lie Ala 
275 280 285 



20 Tyr Ala Met Gin Met Ser Leu Gin Gly Ala Glu Phe Gly Gin Ala Glu 
290 295 300 



Ser Ala Asp lie Asp Ala Ser Ser Ala Met Asp Thr Ser Glu Pro Ala 
25 305 310 315 320 



30 



Lys Glu Glu Asp Asp Tyr Asp Val Met Gin Asp Pro Glu Phe Leu Gin 
325 330 335 



Ser Val Leu Glu Asn Leu Pro Gly Val Asp Pro Asn Asn Glu Ala lie 
340 345 350 



35 



Arg Asn Ala Met Gly Ser Leu Ala Ser Gin Ala Thr Lys Asp Gly Lys 
355 360 365 



40 Lys Asp Lys Lys Glu Glu Asp Lys Lys 
370 375 



